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COMPLEXES OF ALPHA (2) MACROGLOBULIN AND ANTIGENIC 

MOLECULES FOR IMMUNOTHERAPY 

This invention was made with government support under grant number CA64394 
5 awarded by the National Institutes of Health. The government has certain rights in the 
invention. 

1. INTRODUCTION 

The present invention relates to complexes of alpha (2) macroglobulin associated 
10 with antigenic molecules for use in immunotherapy. The invention relates to methods for 
using such compositions in the diagnosis and treatment of immune disorders, proliferative 
disorders, and infectious diseases. 

15 2. BACKGROUND OF THE INVENTION 

2.1. HEAT SHOCK PROTEINS 

Heat shock proteins (HSPs), also referred to as stress proteins, were first identified 
as proteins synthesized by cells in response to heat shock. Hsps have classified into five 
families, based on molecular weight, HsplOO, Hsp90, Hsp70, Hsp60, and smHsp. Many 

20 

members of these families were found subsequently to be induced in response to other 
stressful stimuli including nutrient deprivation, metabolic disruption, oxygen radicals, and 
infection with intracellular pathogens (see Welch, May 1993, Scientific American 56-64; 
Young, 1990, Annu. Rev. Immunol. 8:401-420; Craig, 1993, Science 260:1902-1903; 
Gething etal, 1992, Nature 355:33-45; and Lindquist etal, 1988, Annu. Rev. Genetics 
25 22:631-677). 

Heat shock proteins are among the most highly conserved proteins in existence. For 
example, DnaK, the Hsp70 from E. coli has about 50% amino acid sequence identity with 
Hsp70 proteins from excoriates (Bardwell et al 9 1984, Proc. Natl. Acad. Sci. 81:848-852). 
The Hsp60 and Hsp90 families also show similarly high levels of intra-family conservation 

30 

(Hickey et ai 9 1989, Mol. Cell. Biol. 9:2615-2626; Jindal, 1989, Mol. Cell. Biol. 9:2279- 
2283). In addition, it has been discovered that the Hsp60, Hsp70 and Hsp90 families are 
composed of proteins that are related to the stress proteins in sequence, for example, having 
greater than 35% amino acid identity, but whose expression levels are not altered by stress. 

Studies on the cellular response to heat shock and other physiological stresses 
revealed that the HSPs are involved not only in cellular protection against these adverse 
conditions, but also in essential biochemical and immunological processes in unstressed 
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cells. Hsps accomplish different kinds of chaperoning functions. For example, members of 
the Hsp70 family, located in the cell cytoplasm, nucleus, mitochondria, or endoplasmic 
reticulum (Lindquist et al, 1988, Ann. Rev. Genetics 22:631-677), are involved in the 
presentation of antigens to the cells of the immune system, and are also involved in the 
transfer, folding and assembly of proteins in normal cells. Hsps are capable of binding 
proteins or peptides, and releasing the bound proteins or peptides in the presence of 
adenosine triphosphate (ATP) or low pH. 
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2.2, IMMUNOGENICITY OF HSP-PEPTIDE COMPLEXES 

Srivastava et al demonstrated immune response to methylcholanthrene-induced 
sarcomas of inbred mice (1988, Immunol. Today 9:78-83). In these studies, it was found 
that the molecules responsible for the individually distinct immunogenicity of these tumors 
were glycoproteins of 96kDa (gp96) and intracellular proteins of 84 to 86kDa (Srivastava et 
al, 1986, Proc. Natl. Acad. Sci.USA 83:3407-3411; Ullrich etal, 1986, Prop. Natl. Acad. 
Sci. USA 83:3 121-3 125). Immunization of mice with gp96 or p84/86 isolated from a 
particular tumor rendered the mice immune to that particular tumor, but not to antigenically 
distinct tumors. Isolation and characterization of genes encoding gp96 and p84/86 revealed 
significant homology between them, and showed that gp96 and p84/86 were, respectively, 
the endoplasmic reticular and cytosolic counterparts of the same heat shock proteins 
(Srivastava et al, 1988, hnmunogenetics 28:205-207; Srivastava et al, 1991, Curr. Top. 
Microbiol. Immunol. 167:109-123). Further, Hsp70 was shown to elicit immunity to the 
tumor from which it was isolated but not to antigenically distinct tumors. However, Hsp70 
depleted of peptides was found to lose its immunogenic activity (Udono and Srivastava, 
1993, J. Exp. Med. 178:13914396). These observations suggested that the heat shock 
proteins are not immunogenic per se, but form noncovalent complexes with antigenic 
peptides, and the complexes can elicit specific immunity to the antigenic peptides 
(Srivastava, 1993, Adv. Cancer Res. 62:153-177; Udono et al, 1994, J. Immunol., 
152:5398-5403; Suto et al, 1995, Science 269:1585-1588). 

Noncovalent complexes of HSPs and peptide, purified from cancer cells, can be used 
for the treatment and prevention of cancer and have been described in PCT publications WO 
96/10411, dated April 11, 1996, and WO 97/10001, dated March 20, 1997 (U.S. Patent No. 
5,750,119 issued April 12, 1998, and U.S. Patent No. 5,837,251 issued November 17, 1998, 
respectively, each of which is incorporated by reference herein in its entirety). The isolation 
and purification of stress protein-antigen complexes has been described^ for example, from 
pathogen-infected cells, and used for the treatment and prevention of infection caused by the 
pathogen, such as viruses, and other intracellular pathogens, including bacteria, protozoa, 
fungi and parasites (see, for example, PCT Publication WO 95/24923, dated September 21, 
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1995). Immunogenic stress protein-antigen complexes can also be prepared by in vitro 
complexing of stress protein and antigenic peptides, and the uses of such complexes for the 
treatment and prevention of cancer and infectious diseases has been described in PCT 
publication WO 97/10000, dated March 20, 1997 (U.S. Patent No. 6,030,618 issued February 
29, 2000. The use of stress protein-antigen complexes for sensitizing antigen presenting 
cells in vitro for use in adoptive immunotherapy is described in PCT publication WO 
97/10002, dated March 20, 1997 (see also U.S. Patent No. 5,985,270 issued November 16, 



2.3. ALPHA (2) MACROGLOBULIN RECEPTOR 

The a-macroglobulins are members of a protein superfamily of structurally related 
proteins which also comprises complement components C3, C4 and C5. The human plasma 
protein alpha (2) macroglobulin (<x2M) is a 720 kDa homotetrameric protein primarily 
known as proteinase inhibitor and plasma and inflammatory fluid proteinase scavenger 
molecule (for review see Chu and Pizzo, 1994, Lab. Invest. 71:792). Alpha (2) 
macroglobulin is synthesized as a 1474 amino acid precursor, the first 23 of which function 
as a signal sequence that is cleaved to yield a 1451 amino acid mature protein (Kan et aL, 
1985, Proc. Natl. Acad. Sci. U.S.A. 82:2282-2286). 

Alpha (2) macroglobulin promiscuously binds to proteins and peptides with 
nucleophilic amino acid side chains in a covalent manner (Chu et aL, 1994, Ann. N.Y. Acad. 
Sci. 737:291-307) and targets them to cells which express the a2M receptor (a2MR) (Chu 
and Pizzo, 1993, J. Immunol. 150:48). Binding of a2M to the a2M receptor is mediated by 
the C-terminal portion of a2M (Holtet et aL, 1994, FEBS Lett. 344:242-246) and key 
residues have been identified (Nielsen et aL, 1996, J. Biol. Chem. 271:12909-12912). 

Generally known for inhibiting protease activity, a2M binds to a variety of proteases 
thorough multiple binding sites (see, e.g., Hall etaL, 1981, Biochem. Biophys. Res. 
Commun. 100(1):8-16). Protease interaction with a2M results in a complex structural 
rearrangement called transformation, which is the result of a cleavage within the "bait" 
region of a2M after the proteinase becomes "trapped" by thioesters. The conformational 
change exposes residues required for receptor binding, allowing the a2M-proteinase complex 
to bind to the a2MR. Methylamine can induce similar conformational changes and cleavage 
as that induced by proteinases. The uncleaved form of a2M, which is not recognized by the 
receptor, is often referred to as the "slow" form (s-a2M). The cleaved form is referred to as 
the "fast" form (f-a2M) (reviewed by Chu et aL, 1994, Ann. N.Y. Acad. Sci. 737:291-307). 

Studies have shown that in addition to its proteinase-inhibitory functions, <x2M, when 
complexed to antigens, can enhance the antigens' ability to be taken up by antigen presenting 
cells such as macrophages and presented to T cell hybridomas in vitro by up to two orders of 
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magnitude (Chu and Pizzo, 1994, Lab. Invest. 71:792), and induce T cell proliferation 
(Osada et al., 1987, Biochem. Biophys. Res. Commun. 146:26-31). Further evidence 
suggests that complexing antigen with <x2M enhances antibody production by crude spleen 
cells in vitro (Osada et al, 1988, Biochem. Biophys. Res. Commun. 150:883) elicits an in 
vivo antibody responses in experimental rabbits (Chu etal, 1994, J. Immunol. 152:1538- 
1545) and mice (Mitsuda etal, 1993, Biochem. Biophys. Res. Commun. 101:1326-1331). 
However, none of these studies have shown whether alpha2M-antigen complexes are capable 
of eliciting cytotoxic T cell responses in vivo. 

2.4. IMMUNOGENICITY OF HEAT SHOCK/STRESS PROTEINS 

Srivastava et al demonstrated immune response to methylcholanthrene-induced 
sarcomas of inbred mice (1988, Immunol. Today 9:78-83). In these studies, it was found that 
the molecules responsible for the individually distinct immunogenicity of these tumors were 
identified as cell-surface glycoproteins of 96kDa (gp96) and intracellular proteins of 84 to 
86kDa (Srivastava et al, 1986, Proc. Natl. Acad. Sci. USA 83:3407-341 1 ; Ullrich, S J. et al, 
1986, Proc. Natl. Acad Sci. USA 83:3121-3125). Immunization of mice with gp96 or 
p84/86 isolated from a particular tumor rendered the mice immune to that particular tumor, 
but not to antigenically distinct tumors. Isolation and characterization of genes encoding 
gp96 and p84/86 revealed significant homology between them, and showed that gp96 and 
p84/86 were, respectively, the endoplasmic reticular and cytosolic counterparts of the same 
heat shock proteins (Srivastava et al, 1988, Immunogenetics 28:205-207; Srivastava et al, 
1991, Curr. Top. Microbiol. Immunol, 167:109-123). Further, Hsp70 was shown to elicit 
immunity to the tumor from which it was isolated but not to antigenically distinct tumors. 
However, Hsp70 depleted of peptides was found to lose its immunogenic activity (Udono 
and Srivastava, 1993, J. Exp. Med. 178:1391-1396). These observations suggested that the 
heat shock proteins are not immunogenic per se, but form noncovalent complexes with 
antigenic peptides, and the complexes can elicit specific immunity to the antigenic peptides 
(Srivastava, 1993, Adv. Cancer Res. 62:153-177; Udono etal, 1994, J. Immunol., 152:5398- 
5403; Suto etal, 1995, Science, 269:1585-1588). 

The use of noncovalent complexes of stress proteins and peptides, purified from 
cancer cells* for the treatment and prevention of cancer, as well as the use of such complexes 
in combination with adoptive immunotherapy, has been described (see U.S. Patent No. 
5,750,199; U.S. Patent No. 5,830,464; Patent Cooperation Treaty ("PCT") publications WO 
96/1041 1, dated April 11, 1996; and WO 97/10001, dated March 20, 1997; each of which is 
incorporated by reference herein in its entirety. The purification of stress protein-peptide 
complexes from cell lysates has been described previously; stress protein-peptide complexes 
can be isolated from pathogen-infected cells and used for the treatment and prevention of 
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infection caused by pathogens, such as viruses and other intracellular pathogens, including 
bacteria, protozoa, fungi and parasites (see PCT publication WO 95/24923, dated September 
21, 1995). 

Immunogenic stress protein-peptide complexes can also be prepared by in vitro 
complexing of stress protein and antigenic peptides, and the uses of such complexes for the 
treatment and prevention of infectious diseases and cancer has been described in PCT 
publication WO 97/10000, dated March 20, 1997. The use of heat shock proteins in 
combination with a defined antigen for the treatment of infectious diseases and cancer have 
also been described in PCT publication WO 97/06821, dated February 27, 1997. The 
administration of expressible polynucleotides encoding eukaryotic heat shock proteins to 
mammalian cells for stimulating an immune response, and for treatment of infectious 
diseases and cancer has been described in PCT publications, WO 97/06685 and WO 
97/06828, both dated February 27, 1997. The use of stress protein-peptide complexes for 
sensitizing antigen presenting cells in vitro for use in adoptive immunotherapy is described 
in PCT publication WO 97/10002, dated March 20, 1997. 

2.5. ANTIGEN PRESENTATION 

Major histocompatibility complex (MHC) molecules present antigens on the cell 
surface of antigen-presenting cells. Cytotoxic T lymphocytes (CTLs) then recognize MHC 
molecules and their associated peptides and kill the target cell. Antigens are processed by 
two distinct antigen processing routes depending upon whether their origin is intracellular or 
extracellular. Intracellular or endogenous protein antigens, ie. 9 antigens synthesized within 
the antigen-presenting cell, are presented by MHC class I (MHC I) molecules to CD8+ 
cytotoxic T lymphocytes. On the other hand, extracellular or exogenously synthesized 
antigenic determinants are presented on the cell surface of "specialized" or "professional" 
APCs (macrophages, for example) by MHC class II molecules to CD4+ T cells (see, 
generally, Fundamental Immunology, W.E. Paul (ed.), New York: Raven Press, 1984). This 
compartmental segregation of antigen processing routes is important to prevent tissue 
destruction that could otherwise occur during an immune response as a result of shedding of 
neighboring cell MHC I antigens. 

The heat shock protein gp96 chaperones a wide array of peptides, depending upon the 
source from which gp96 is isolated (for review, see Srivastava et al, 1998, Immunity 8: 657- 
665). Tumor-derived gp96 carries tumor-antigenic peptides (Ishii et al, 1999, J. 
Immunology 162:1303-1309); gp96 preparations from virus-infected cells carry viral 
epitopes (Suto and Srivastava, 1995, Science 269:1585-1588; Nieland et al, 1998, Proc. 
Natl. Acad. Sci. USA 95:1800-1805), and gp96 preparations from cells transfected with 
model antigens such as ovalbumin or P-galactosidase are associated with the corresponding 
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epitopes (Arnold et al, 1995, J. Exp. Med.l82:885-889; Breloer et aL % 1998, Eur. J. 
Immunol. 28:1016-1021). The association of gp96 with peptides occurs in vivo (Menoret 
and Srivastava, 1999, Biochem. Biophys. Research Commun. 262:813-818). Gp96-peptide 
complexes, whether isolated from cells (Tamura et al 9 1997, Science 278:117-120), or 
reconstituted in vitro (Blachere et al, 1997, J. Exp. Med. 186:1183-1406) are excellent 
immunogens and have been used extensively to elicit CD8+ T cell responses specific for the 
gp96-chaperoned antigenic peptides. 

The capacity of gp96-peptide complexes to elicit an immune response is dependent 
upon the transfer of the peptide to MHC class I molecules of antigen-presenting cells (Suto 
and Srivastava, 1995, supra). Endogenously synthesized antigens chaperoned by gp96 in the 
endoplasmic reticulum [ER] can prime antigen-specific CD8+ T cells (or MHC I-restricted 
CTLs) in vivo; this priming of CD8+ T cells requires macrophages. However, the process 
whereby exogenously introduced gp96-peptide complexes elicit the antigen-specific CD8+ T 
cell response is not completely understood since there is no established pathway for the 
translocation of extracellular antigens into the class I presentation machinery. Yet antigenic 
peptides of extracellular origin associated with HSPs are somehow salvaged by 
macrophages, channeled into the endogenous pathway, and presented by MHC I molecules 
to be recognized by CD8+ lymphocytes (Suto and Srivastava, 1995, supra; Blachere et al, 
1997, J. Exp. Med. 186:1315-22). 

Several models have been proposed to explain the delivery of extracellular peptides 
for antigen presentation. One proposal, known as the "direct transfer" model, suggests that 
HSP-chaperoned peptides are transferred to MHC I molecules on the cell surface of 
macrophages for presentation to CD8+ T lymphocytes. Another suggestion is that soluble 
extracellular proteins can be trafficked to the cytosol via constitutive macropinocytosis in 
bone marrow-derived macrophages and dendritic cells (Norbury et aL, 1997, Eur. J. 
Immunol. 27:280-288). Yet another proposed mechanism is that HSPs are taken up by the 
MHC class I molecules of the macrophage, which stimulate the appropriate T cells 
(Srivastava et al, 1994, Immunogenetics 39:93-98. Others have suggested that a novel 
intracellular trafficking pathway may be involved for the transport of peptides from the 
extracellular medium into the lumen of ER pay et aL, 1997, Proc. Natl. Acad. Sci. 94:8064- 
8069; Nicchitta, 1998, Curr. Opin. in Immunol. 10:103-109). Further suggestions include 
the involvement of phagocytes which (a) possess an ill-defined pathway to shunt protein 
from the phagosome into the cytosol where it would enter the normal class I pathway; (b) 
digest ingested material in lysosomes and regurgitate peptides for loading on the surface to 
class I molecules (Bevan, 1995, J. Exp. Med. 182:639-41). 

Still others have proposed a receptor-mediated pathway for the delivery of 
extracellular peptides to the cell surface of APS for antigen presentation. In view of the 
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extremely small quantity of gp96-chaperoned antigenic peptides required for immunization 
(Blachere et aL, 1997, supra), and the strict dependence of immunogenicity of gp96-peptide 
complexes on functional antigen presenting cells (APCs) (Udono et aL, 1994, Proc. Natl. 
Acad. Sci. U.S.A. 91 :3077-3081), APCs had been proposed to possess receptors for gp96 
(Srivastava et aL, 1994, Immunogenetics 39:93-98). Preliminary microscopic evidence 
consistent with such receptors has been recently obtained (Binder et aL, 1998, Cell Stress & 
Chaperones 3 (Supp.l):2.; Arnold-SchildeM/., 1999, J. Immunol. 162: 3757-3760; and 
Wassenberg et aL, 1999, J. Cell Sci. 1 :12). One hypothesis is that the mannose receptor is 
used in the uptake of gp96, but no mechanism has been proposed for the non-glycosylated 
HSPs, such as Hsp70 (Ciupitu etaL, 1998, J. Exp. Med., 187:685-691). 

The identification and characterization of specific molecules involved in HSP- 
mediated antigen presentation of peptides, could provide useful reagents and techniques for 
eliciting specific immunity by HSP and HSP-peptide complexes, and for developing novel 
diagnostic and therapeutic methods. 

Citation or discussion of a reference herein shall not be construed as an admission 
that such is prior art to the present invention. 



3. SUMMARY OF THE INVENTION 

20 The present invention provides complexes comprising alpha (2) macroglobulin 

("a2M") and methods for their use in immunotherapy. The invention is based, in part, on the 
Applicant's discovery that a2M directly competes for the binding of heat shock protein gp96 
to the o2M receptor, indicating that a2M and HSPs may bind to a common recognition site 
on the alpha (2) macroglobulin receptor. Thus, because HSPs and a2M have a number of 

25 common functional attributes, such as the ability to bind peptides and the recognition and 
uptake by the alpha (2) macroglobulin receptor, the Applicants have discovered that <x2M can 
be used in the methods described herein for immunotherapy against cancer and infectious 
disease. Alpha-2 -macroglobulin can form complexes with antigens, which are taken up by 
antigen presenting cells ("APCs") via the alpha (2) macroglobulin receptor, also known as 

30 LDL (low-density lipoprotein) Receptor-Related Protein ("LRP") or CD91. Thus, the 
invention provides methods and compositions for using specific a2M-antigenic molecule 
complexes for targeting an immune response against immune disorders, proliferative 
disorders, and infectious diseases. 

The invention encompasses complexes of alpha (2) macroglobulin noncovalently 

35 associated antigenic molecules, recombinant cells that express the complexes of a2M 
associated with antigenic molecules, and antibodies and other molecules that specifically 
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recognize <x2M-antigenic molecule complexes. The invention also provides methods for 
using these compositions in the diagnosis and treatment of immune disorders, proliferative 
disorders, and infectious diseases. 

As used herein, an alpha (2) macroglobulin is associated with an antigenic molecule 
is bound to the antigenic molecule by a covalent or noncovalent bond. A covalent bond can 
be a peptide bond or a thioester linkage, for example. Thus, fusion proteins between alpha 
(2) macroglobulin and an antigenic molecule are within the scope of the invention. 

The invention provides a pharmaceutical composition comprising an amount of a 
molecular complex effective for treatment or prevention of an infectious disease or cancer, 
and a pharmaceutically acceptable carrier, said molecular complex comprising an alpha (2) 
macroglobulin polypeptide noncovalently associated with an antigenic molecule which 
displays the antigenicity of an antigen of an infectious agent or an antigen overexpressed in a 
cancer cell relative to its expression in a noncancerous cell of said cell type. As used herein 
a cell type of a cancer cell, refers to the cell type of the tissue of origin, e.g., breast, lung, 
ovarian. In one embodiment, the antigenic molecule displays the antigenicity of an antigen 
of an infectious agent. In another embodiment, the antigenic molecule displays the 
antigenicity of an antigen overexpressed in a cancer cell relative to its expression in a 
noncancerous cell of said cell type. In another embodiment, the antigenic molecule is a 
tumor specific antigen or a tumor-associated antigen. In another embodiment, the antigenic 
molecule displays the antigenicity of an antigen overexpressed in a cancer cell relative to its 
expression in a noncancerous cell of said cell type. 

In another embodiment, the molecular complex effective for treatment or prevention 
of an infectious disease or cancer, comprising the alpha (2) macroglobulin polypeptide 
noncovalently associated with the antigenic molecule is purified. In particular, tile term 
"purified" molecular complexes refer to complexes which are at least 65% 75%, 80%, 85%, 
90%, 95%, 98% or 100% noncovalent complexes of the alpha (2) macroglobulin polypeptide 
and the antigenic molecule. In another embodiment, the purified molecular complex 
comprising an alpha (2) macroglobulin polypeptide associated with an antigenic molecule of 
an infectious agent or an antigen overexpressed in a cancer cell relative to its expression in a 
noncancerous cell of said cell type. 

The invention further provides a purified population of molecular complexes in 
which at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 100% of the complexes 
comprise an alpha (2) macroglobulin noncovalently associated with an antigenic molecule. 
Also provided by the invention is a purified population of molecular complexes purified 
from a recombinant cell in which at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 
100% of the complexes comprise an alpha (2) macroglobulin noncovalently associated with 
an antigenic molecule. 
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The invention also provides a recombinant cell infected with a pathogen and 
transformed with a nucleic acid comprising a nucleotide sequence that (i) is operably linked 
to a promoter, and (ii) encodes an alpha (2) macroglobulin polypeptide, which alpha (2) 
macroglobulin polypeptide associates with an antigenic molecule, when said antigenic 
molecule is present, to form a complex that in sufficient amount is capable of eliciting an 
immune response to the antigenic molecule. The invention provides a recombinant cancer 
cell transformed with a nucleic acid comprising a nucleotide sequence that (i) is operably 
linked to a promoter, and (ii) encodes an alpha (2) macroglobulin polypeptide, which alpha 
(2) macroglobulin polypeptide associates with an antigenic molecule, when said antigen is 
present, to form a complex that in sufficient amount is capable of eliciting an immune 
response to the antigenic molecule. In another embodiment, the invention provides a 
recombinant cell transformed with (i) a first nucleic acid comprising a first nucleotide 
sequence that is operably linked to a first promoter and encodes an alpha (2) macroglobulin 
polypeptide, and (ii) a second nucleic acid comprising a second nucleotide sequence that is 
operably linked to a second promoter and encodes an antigenic molecule, such that the alpha 
(2) macroglobulin polypeptide and the antigenic molecule are expressed within the cell and 
associate with each other to form a complex that in sufficient amount is capable of eliciting 
an immune response to the antigenic molecule. In various embodiments, the recombinant 
cells are human cells. In various embodiments, the pharmaceutical composition comprises a 
recombinant cell and a pharmaceutically acceptable carrier. 

In one embodiment, a method is provided for preparing a complex of an alpha (2) 
macroglobulin polypeptide ndncovalently associated with an antigenic molecule, said alpha 
(2) macroglobulin polypeptide comprising: (a) culturing a cell transformed with a nucleic 
acid comprising a nucleotide sequence encoding the alpha (2) macroglobulin polypeptide, 
under conditions such that said alpha (2) macroglobulin polypeptide is expressed by the cells 
and associates with an antigenic molecule of the cell; and (b) recovering a population of 
complexes of the alpha (2) macroglobulin polypeptide noncovalently associated with 
antigenic molecules from the cells. 

The invention further provides a method for preparing an alpha (2) macroglobulin 
polypeptide noncovalently associated with antigenic molecules derived from one or more 
antigens of an infectious agent, comprising: culturing infected cells, transformed with a 
nucleic acid comprising a nucleotide sequence encoding the alpha (2) macroglobulin 
polypeptide, or fragment, analog, or variant thereof, and operably linked to a promoter, under 
conditions such that the alpha (2) macroglobulin polypeptideis expressed by the cells and 
associates with peptides of the cells; and (b) recovering from the cells a population of 
complexes of the alpha (2) macroglobulin polypeptide noncovalently associated with 
peptides derived from the infectious agent. In one embodiment, the method further 
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comprises purifying the complexes. In another embodiment, the method further comprising 
purifying the complexes by affinity chromatography. 

The invention further provides a method of treating or preventing an infectious 
disease in an individual having an infectious disease comprising administering to the 
individual one or more immunogenic complexes of an alpha (2) microglobulin polypeptide 
noncovalently associated with a first antigenic molecule, wherein the first antigenic molecule 
displays the antigenicity of an antigen of an infectious agent of the infectious disease. In 
another embodiment, the method further comprises, before, concurrently or after 
administration of the immunogenic complex, administering to the individual a composition 
comprising antigen presenting cells sensitized in vitro with a sensitizing amount of a second 
complex of alpha (2) macroglobulin polypeptide noncovalently associated with a second 
antigenic molecule, said second antigenic molecule displaying the antigenicity of a second 
antigenic molecule of said infectious agent. 

The invention further provides a method of treating or preventing an infectious 
disease in a subject having an infectious disease comprising: a) culturing an infected cell 
transformed with a nucleic acid comprising a nucleotide sequence encoding an alpha (2) 
macroglobulin polypeptide, said infected cell displaying the antigenicity of an antigen of an 
infectious agent of the infectious disease, said nucleotide sequence being operably linked to a 
promoter, under conditions such that the alpha (2) macroglobulin polypeptide is expressed by 
the infected cells and associates with antigenic molecules of the cell; b) recovering 
complexes of the alpha (2) macroglobulin polypeptide noncovalently associated with 
antigenic molecules from the infected cell; and c) administering to the subject an amount of 
the recovered complexes effective to treat or prevent the infectious disease. In one 
embodiment, the method further comprises, prior to step (a), the step of obtaining infected 
cells from the subject and transforming the infected cells with the nucleic acid. In another 
embodiment, the method further comprises, prior to step (a), the step of obtaining the 
infected cell from one or more individuals and transforming the infected cells with the 
nucleic acid, said one or more individuals being different from the subject and having the 
same type of infectious disease as the subject. 

The invention further provides a method of treating or preventing an infectious 
disease in a subject having an infectious disease comprising: a) culturing a recombinant cell 
transformed with (i) a first nucleic acid encoding an alpha (2) macroglobulin polypeptide and 
(ii) a second nucleic acid encoding an antigenic molecule displaying the antigenicity of an 
antigen of an infectious agent of the infectious disease; b) recovering complexes of the alpha 
(2) macroglobulin polypeptide noncovalently associated with the antigen; and c) 
administering to the subject an amount of the recovered complexes effective to treat or 
prevent the infectious disease. In various embodiments, the infectious disease is caused by 
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an infectious agent selected from the group consisting of a virus, a bacterium, a fungus, and a 
parasite. 

Also provide by the invention is a method of treating or preventing cancer in an 
individual having a type of cancer or in whom prevention of a type of cancer is desired 
comprising administering to the individual an immunogenic complex of an alpha (2) 
macroglobulin polypeptide noncovalently associated with a first antigenic molecule, wherein 
either (a) the first antigenic molecule displays antigenicity of said type of cancer or a 
metastasis thereof; or (b) the complex is obtained by recovering complexes from said type of 
cancer cells or a metastasis thereof that recombinantly express the alpha (2) macroglobulin 
polypeptide. In one embodiment, this method further comprises, before, concurrently or 
after administration of the immunogenic complex, administering to the individual a 
composition comprising antigen presenting cells sensitized in vitro with a sensitizing amount 
of a second complex of an alpha (2) macroglobulin noncovalently associated with a second 
antigenic molecule, said second antigenic molecule displaying the antigenicity of an antigen 
overexpressed in a cancer cell relative to its expression in a noncancerous cell of said cell 
type, an infectious agent of the infectious disease. In another embodiment, the first 
antigenic molecule is an antigen overexpressed in a cancer cell relative to its expression in a 
noncancerous cell of said cell type. 

In one embodiment, a method is provided for treating or preventing cancer in a 
subject having a type of cancer or in whom prevention of a type of cancer is desired 
comprising: a) culturing a cancer cell of said type transformed with a nucleic acid comprising 
a nucleotide sequence encoding an alpha (2) macroglobulin polypeptide said nucleotide 
sequence being operably linked to a promoter, under conditions such that the alpha (2) 
macroglobulin polypeptide is expressed by the cancer cell and associates with at least one 
antigenic molecule of the cell; b) recovering complexes of the alpha (2) macroglobulin 
polypeptide noncovalently associated with at least one antigenic molecule from the cancer 
cell; and c) administering to the subject an amount of the recovered complexes effective to 
treat or prevent cancer. In one embodiment, the method further comprises, prior to step (a), 
the step of obtaining cancer cells from the subject and transforming the cancer cells with the 
nucleic acid. In another embodiment, the method further comprises, prior to step (a), the step 
of obtaining cancer cells from one or more individuals and transforming the cancer cells with 
the nucleic acid, said one or more individuals being different from the subject and having the 
same type of cancer as the subject. 

The invention further provides a method of treating or preventing cancer in a subject 
having a type of cancer or in whom prevention of a type of cancer is desired comprising: a) 
culturing a recombinant cell transformed with (i) a first nucleic acid encoding an alpha (2) 
macroglobulin polypeptide, and (ii) a second nucleic acid encoding an antigenic molecule 
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displaying the antigenicity of an antigen of a cancer cell; b) recovering complexes of the 
alpha (2) macroglobulin polypeptide noncovalently associated with the antigen; and c) 
administering to the subject an amount of the recovered complexes effective to treat or 
prevent cancer. 

As used herein a "type of cancer" refers to e.g., melanoma, breast cancer, renal 
carcinoma, or a metastasis thereof, where a metastasis refers to the same type of cancer as the 
cell of origin. In various embodiments, the cancer is selected from the group consisting of 
fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, 
angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, 
synovioma, mesothelioma, Swing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon 
carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell 
carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland 
carcinoma, papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary 
carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, 
choriocarcinoma, seminoma, embryonal carcinoma, Wilms' tumor, cervical cancer, testicular 
tumor, lung carcinoma, small cell lung carcinoma, bladder carcinoma, epithelial carcinoma, 
glioma, astrocytoma, medulloblastoma, craniopharyngioma, ependymoma, pinealoma, 
hemangioblastoma, acoustic neuroma, oligodendroglioma, meningioma* melanoma, 
neuroblastoma, retinoblastoma, leukemia, lymphoma, multiple myeloma, Waldenstrom's 
macroglobulinemia, and heavy chain disease. 

The invention also encompasses a method for treating an autoimmune disorder 
comprising administering to a mammal in need of such treatment an antibody specific for 
alpha (2) macroglobulin. In one embodiment, the antibody is purified. 



FIG. 1 A-D . Identification of an 80 kDa polypeptide as a putative gp96 receptor. A. 
Confocal microscopy of re-presentation-competent RAW264.7 cells stained with gp96-FITC 
and B. with albumm-FTTC. C. SDS-PAGE analysis of detergent extracts of plasma 
membranes from surface bibtinylated RAW264.7 (re-presentation-competent) or P815 cells 
(representation-incompetent) eluted from gp96 or albumin-Sepharose (SA) columns and 
stained with silver stain (top) or avidin-peroxidase (bottom). D. gp96-SASD-I 125 was 
cross-linked to live peritoneal macrophages (MO) or P815 cells, and the cell lysates 
examined by SDS-PAGE and autoradiography. Various components were omitted as 
controls, as indicated. 
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FIG, 2A-B. Anti-p80 antiserum detects an 80 kDa molecule and inhibits re-presentation of 
gp96-chaperoned AHI peptide by macrophage. A, Pre-immune and immune sera ware used 
to probe blots of plasma membrane extracts of RAW264.7, peritoneal macrophages (both 
cell types re-presentation-competent), or P815 cells. B. Re-presentation of gp96-chaperoned 
peptide AHI. Sera were added at the final dilution indicated. The solid cross indicates the 
level of T cell stimulation when the APCs were pulsed directly with the AHi peptide. The 
open cross indicates the corresponding value with unpulsed APCs. 

FIG. 3A-C. Protein microsequencing of the 80 kDa protein. A. Analysis of a single tryptic 
(GALHIYHQR) peptide by tandem- mass spectrometry. All possible b- and y-ion series 
together with identified b-ion series (red) and y-ion series (blue) are shown. B. Collision- 
induced dissociation (CID) spectrum of this peptide is shown. C. Four identified peptides 
from the a2M receptor, peptide mass, and sequence are shown. 

FIG. 4. a2-Macroglobulin inhibits re-presentation of gp96-chaperoned AHI peptide by 
macrophage. The solid cross indicates the level of T cell stimulation when the APCs were 
pulsed directly with the AHI peptide. The open cross indicates the corresponding value with 
impulsed APCs. 

FIG. 5. a2M receptor is a sensor of necrotic cell death due to its ability to detect 
extracellular gp96. Conversely, receptors (psR) for phosphatidyl serine (ps) detect apoptotic 
cell death. 



FIG. 6A-B. A. The mouse a2MR cDNA (SEQ ID NO:l) and predicted open reading frame 
of murine <x2MR protein (Genbank accession no. CAA47817). B. The murine a2M protein 
(SEQ ID NO:2), with residues identified by microsequencing an 80 kDa, gp96-interacting 
fragment of the receptor highlighted in bold. 



FIG. 7A-B. A Amino acid sequence of a2M (SEQ ID NO: 3). B. Nucleotide sequence 
of a2M (SEQ ID NO: 4). The 138 amino acid sequence (SEQ ID NO.: 5) of the receptor 
binding domain from a2M is underlined. 
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5. DETAILED DESCRIPTION OF THE INVENTION 



The present invention provides compositions and methods for alpha (2) 
macroglobulin ("a2M") vaccines for use in immunotherapy. The invention is based, in part, 
on the Applicant's discovery that a2M blocks uptake of heat shock proteins by antigen 
presenting cells. In particular, the invention provides complexes of a2M associated with 
antigenic molecules, which are recognized by the alpha (2) macroglobulin receptor on 
antigen presenting cells ("APCs"), and are presented by such cells to the immune system. 
Thus, the invention provides methods and compositions for using specific a2M-antigenic 
molecule complexes for targeting an immune response against immune disorders, 
proliferative disorders, and infectious diseases. 

The human plasma protein alpha (2) macroglobulin is a 720 kDa homotetrameric 
proteinase inhibitor primarily known as proteinase inhibitor and plasma and inflammatory 
fluid proteinase scavenger molecule (for review see Chu and Pizzo, 1994, Lab. Invest. 
71 :792). During proteolytic activation of a2M, non-proteolytic ligands can become 
incorporated, covalently and noncovalently, to the activated thioesters (see Osada et aL, 
1987, Biochem. Biophys. Res. Comm. 146:26-31; Osada et aL, 1988, Biochem. Biophys. 
Res. Comm. 150:883-889; Chu and Pizzo, 1993, J. Immunology 150: 48-58; Chu et aL, 
1994, 152:1538-1545; Mitsudae/ aL, 1993, Biochem. Biophys. Res. Comm. 191:1326- 
1331). As described herein, when complexes formed between a2M and an antigenic 
molecule having the antigenicity of a cancer cell antigen or of a pathogen, such a2M 
-antigenic molecule complexes can be used to stimulate a cytotoxic T cell response directed 
against the a2M incorporated antigen. Such complexes can be used as immunotherapeutic 
agents to treat cancer and infectious diseases. 

Described in detailed hereinbelow are methods and compositions for use in 
preparation and delivery of such therapeutic a2M-antigenic molecule complexes. The 
invention encompasses complexes of alpha (2) macroglobulin associated antigenic 
molecules, antigenic cells that express the a2M, and antibodies and other molecules that 
specifically recognize a2M-antigenic molecule complexes. The invention also relates to 
methods for using these compositions in the diagnosis and treatment of immune disorders, 
proliferative disorders, and infectious diseases. 

5.1 COMPOSITIONS OF THE INVENTION 

The present invention provides compositions that can be used in immunotherapy 
against proliferative disorders, infectious diseases, and immune disorders. Such 
compositions include antibodies that specifically recognize a2M complexes, isolated 
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antigenic cells that express a2M complexes, and recombinant cells that contain recombinant 
<x2M and sequences encoding antigenic molecules. 

It is contemplated that the definition of a2M, as used herein, embraces other 
polypeptide fragments, analogs, and variants of a2M having at least 35% to 55%, preferably 
55% to 75%, and most preferably 75% to 85% amino acid identity with a2M, and is capable 
of forming a complex with an antigenic molecule, which complex is capable of being taken 
up by an antigen presenting cell and eliciting an immune response against the antigenic 
molecule. The determination of percent identity between two sequences can also be 
accomplished using a mathematical algorithm. A preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of two sequences is the algorithm of 
Karlin and Altschui, 1990, Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin 
and Altschui, 1993, Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm is 
incorporated into the NBLAST and XBLAST programs of Altschui, et al, 1990, J. Mol. 
Biol. 215:403-410. BLAST nucleotide searches can be performed with the NBLAST 
program, score = 100, wordlength = 12 to obtain nucleotide sequences homologous to a 
nucleic acid molecules of the invention. BLAST protein searches can be performed with the 
XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences homologous 
to a protein molecules of the invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in Altschui et al, 1997, Nucleic Acids 
Res.25:3389-3402. Alternatively, PSI-Blast can be used to perform an iterated search which 
detects distant relationships between molecules (Altschui et al , 1 997, supra). When 
utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the default parameters of the 
respective programs (e.g., XBLAST and NBLAST) can be used (see 
http://www.ncbi.nlm.nih.gov). Another preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of sequences is the algorithm of Myers and Miller, 
1988, CABIOS 4:11-17. Such an algorithm is incorporated into the ALIGN program 
(version 2.0) which is part of the GCG sequence alignment software package. When 
utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight 
residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

The immunogenic a2M-antigenic molecule complexes of the invention may include 
any complex containing an a2M and an antigenic peptide that is capable of inducing an 
immune response in a mammal. 

ct2M and/or antigenic molecules can be purified from natural sources, chemically 
synthesized, or recombinantly produced. 
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5.1.1. a2M POLYPEPTIDES 

The alpha (2) macroglobulin complex of the invention is comprised of an alpha (2) 
macroglobulin polypeptide associated with an antigenic peptide. Alpha (2) macroglobulin 
polypeptides may be produced by recombinant DNA techniques, synthetic methods, or by 
enzymatic or chemical cleavage of native a2M polypeptides. Described herein are methods 
for producing such a2M polypeptides.. 



In various aspects, the invention relates to compositions comprising amino acid 
sequences of <x2M, and fragments, derivatives, analogs, and variants thereof. Nucleic acids 
encoding a2M are provided, as well as nucleic acids complementary to and capable of 
hybridizing to such nucleic acids. 

Any eukaryotic cell may serve as the nucleic acid source for obtaining the coding 
region of an a2M gene. Nucleic acid sequences encoding <x2M can be isolated from 
vertebrate, mammalian, as well as primate sources, including humans. 

Amino acid sequences and nucleotide sequences of naturally occurring a2M 
polypeptides are generally available in sequence databases, such as GenBank. Non-limiting 
examples of a2M sequences that can be used for preparation of the a2M polypeptides of the 
invention are as follows: Genbank Accession Nos. Ml 13 13, P01023, AAA51551; Kane* 
al., 1985, Proc. Nat. Acad. Sci. 82: 2282-2286. Due to the degeneracy of the genetic code, 
the term "a2M gene", as used herein, refers not only to the naturally occurring nucleotide 
sequence but also encompasses all the other degenerate DNA sequences that encode an a2M 
polypeptide. Computer programs, such as Entrez, can be used to browse the database, and 
retrieve any amino acid sequence and genetic sequence data of interest by accession number. 
These databases can also be searched to identify sequences with various degrees of 
similarities to a query sequence using programs, such as FASTA and BLAST, which rank the 
similar sequences by alignment scores and statistics. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide 
sequences homologous to a nucleic acid molecules of the invention. BLAST protein 
searches can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain 
amino acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et al, 1997, Nucleic Acids Res.25:3389-3402. Alternatively, PSI-Blast can be used 
to perform an iterated search which detects distant relationships between molecules (Altschul 
et ai 9 1997, supra). When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the 
default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used 
(see http://www.ncbi.nlm.nih.gov). 



5.1.1.1 ISOLATION OF ot2M GENE SEQUENCES 
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The DNA may be obtained by standard procedures known in the art by DNA 
amplification or molecular cloning directly from a tissue, cell culture, or cloned DNA (e.g., a 
DNA "library"). Clones derived from genomic DNA may contain regulatory and intron 
DNA regions in addition to coding regions; clones derived from cDNA will contain only 
exon sequences. Whatever the source, the a2M gene should be cloned into a suitable vector 
for propagation of the gene. 

In a preferred embodiment, DNA can be amplified from genomic or cDNA by 
polymerase chain reaction (PCR) amplification using primers designed from the known 
sequence of a related or homologous a2M. PCR is used to amplify the desired sequence in 
DNA clone or a genomic or cDNA library, prior to selection. PCR can be carried out, e.g., 
by use of a thermal cycler and Taq polymerase (sold under the trademark GENE AMP). The 
DNA being amplified can include cDNA or genomic DNA from any species. 
Oligonucleotide primers representing known nucleic acid sequences of related HSPs can be 
used as primers in PCR. In a preferred aspect, the oligonucleotide primers represent at least 
part of the a2M gene that is highly conserved between a2M genes of different species. One 
can choose to synthesize several different degenerate primers, for use in the PCR reactions. 
It is also possible to vary the stringency of hybridization conditions used in priming the PCR 
reactions, to allow for greater or lesser degrees of nucleotide sequence similarity between the 
known a2M nucleotide sequence and the nucleic acid homolog being isolated. For cross 
species hybridization, low stringency conditions are preferred. For same species 
hybridization, moderately stringent conditions are preferred. After successful amplification, 
the sequence encoding an o2M may be cloned and sequenced. If the size of the coding 
region of the <x2M gene being amplified is too large to be amplified in a single PCR, several 
PCR covering the entire gene, preferably with overlapping regions, may be carried out, and 
the products of the PCR ligated together to form the entire coding sequence. Alternatively, if 
a segment of an a2M gene is amplified, that segment may be cloned, and utilized as a probe 
to isolate a complete cDNA or genomic clone. 

In another embodiment, for the molecular cloning of an a2M gene from genomic 
DNA, DNA fragments are generated to form a genomic library. Since some of the sequences 
encoding related a2Ms are available and can be purified and labeled, the cloned DNA 
fragments in the genomic DNA library may be screened by nucleic acid hybridization to the 
labeled probe (Benton and Davis, 1977, Science 196:180; Grunstein and Hogness, 1975, 
Proc. Natl. Acad. Sci. U.S.A. 72:3961). Those DNA fragments with substantial homology to 
the probe will hybridize. It is also possible to identify the appropriate fragment by restriction 
enzyme digestion(s) and comparison of fragment sizes with those expected according to a 
known restriction map if such is available. 
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Alternatives to isolating the a2M genomic DNA include, but are not limited to, 
chemically synthesizing the gene sequence itself from a known sequence or making cDNA to 
the mRNA which encodes a2M. For example, RNA for cDNA cloning of the ct2M gene can 
be isolated from cells which express a2M. A cDNA library may be generated by methods 
known in the art and screened by methods, such as those disclosed for screening a genomic 
DNA library. If an antibody to a2M is available, a2M may be identified by binding of 
labeled antibody to the putatively a2M synthesizing clones. 

Other specific embodiments for the cloning of a nucleotide sequence encoding an 
<x2M, are presented as examples but not by way of limitation, as follows: 

In a specific embodiment, nucleotide sequences encoding a2M proteins within a 
family can be identified and obtained by hybridization with a probe comprising nucleotide 
sequence encoding a2M under conditions of low to medium stringency. 

By way of example and not limitation, procedures using such conditions of low 
stringency are as follows (see also Shilo and Weinberg, 1981, Proc. Natl. Acad. Sci. USA ' 
78:6789-6792). Filters containing DNA are pretreated for 6 h at 40°C in a solution 
containing 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM EDTA, 0.1% PVP, 
0.1% Ficoll, 1% BSA, and 500 \ig/ml denatured salmon sperm DNA. Hybridizations are 
carried out in the same solution with the following modifications: 0.02% PVP, 0.02% Ficoll, 
0.2% BSA, 100 fig/ml salmon sperm DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 6 
cpm 32 P-labeled probe is used. Filters are incubated in hybridization mixture for 18-20 h at 
40°C, and then washed for 1 .5 h at 55 °C in a solution containing 2X SSC, 25 mM Tris-HCl 
(pH 7.4), 5 mM EDTA, and 0.1% SDS. The wash solution is replaced with fresh solution 
and incubated an additional 1 .5 h at 60° C. Filters are blotted dry and exposed for 
autoradiography. If necessary, filters are washed for a third time at 65-68 °C and reexposed 
to film. Other conditions of low stringency which may be used are well known in the art 
(e.g., as employed for cross-species hybridizations). 

An <x2M gene fragment can be inserted into an appropriate cloning vector and 
introduced into host cells so that many copies of the gene sequence are generated. A large 
number of vector-host systems known in the art may be used such as, but not limited to, 
bacteriophages such as lambda derivatives, or plasmids such as pBR322 or pUC plasmid 
derivatives or the Bluescript vector (Stratagene). 

Any technique for mutagenesis known in the art can be used to modify individual 
nucleotides in a DNA sequence, for purpose of making amino acid substitutions) in the 
expressed peptide sequence, or for creating/deleting restriction sites to facilitate further 
manipulations. Such techniques include but are not limited to, chemical mutagenesis, in 
vitro site-directed mutagenesis (Hutchinson et al, 1978, J. Biol. Chem 253:6551), 
oligonucleotide-directed mutagenesis (Smith, 1985, Ann. Rev. Genet. 19:423-463; Hill et al. 9 
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1987, Methods Enzymol. 155:558-568), PCR-based overlap extension (Ho et aU 1989, Gene 
77:51-59), PCR-based megaprimer mutagenesis (Sarkar et al, 1990, Biotechniques, 8:404- 
407), etc. Modifications can be confirmed by double stranded dideoxy DNA sequencing. 

The polymerase chain reaction (PCR) is commonly used for obtaining genes or gene 
fragments of interest For example, a nucleotide sequence encoding a2M polypeptide of any 
desired length can be generated using PCR primers that flank the nucleotide sequence 
encoding a2M, or the peptide-binding domain thereof. Alternatively, an a2M gene sequence 
can be cleaved at appropriate sites with restriction endonuclease(s) if such sites are available, 
releasing a fragment of DNA encoding a2M, or the peptide-binding domain thereof. If 
convenient restriction sites are not available, they may be created in the appropriate positions 
by site-directed mutagenesis and/or DNA amplification methods known in the art (see, for 
example, Shankarappa et aL, 1992, PCR Method Appl. 1:277-278). The DNA fragment that 
encodes a2M, or the peptide-binding domain thereof, is then isolated, and ligated into an 
appropriate expression vector, care being taken to ensure that the proper translation reading 
frame is maintained 

Alpha (2) macroglobulin polypeptides of the invention may be expressed as fusion 
proteins to facilitate recovery and purification from the cells in which they are expressed. 
For example, an a2M polypeptide may contain a signal sequence leader peptide to direct its 
translocation across the ER membrane for secretion into culture medium. Further, an a2M 
polypeptide may contain an affinity label, such as a affinity label, fused to any portion of the 
a2M polypeptide not involved in binding antigenic peptide, such as for example, the 
carboxyl terminal. The affinity label can be used to facilitate purification of the protein, by 
binding to an affinity partner molecule. 

Various methods for production of such fusion proteins are well known in the art. 
The manipulations which result in their production can occur at the gene or protein level, 
preferably at the gene level. For example, the cloned coding region of an a2M polypeptide 
may be modified by any of numerous recombinant DNA methods known in the art 
(Sambrook et aL, 1990, Molecular Cloning, A Laboratory Manual, 2d ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York; Ausubel et aL, in Chapter 8 of Current 
Protocols in Molecular Biology, Greene Publishing Associates and Wiley Interscience, New 
York). It will be apparent from the following discussion that substitutions, deletions, 
insertions, or any combination thereof are introduced or combined to arrive at a final 
nucleotide sequence encoding an a2M polypeptide. 

In various embodiments, fusion proteins comprising the a2M polypeptide may be 
made using recombinant DNA techniques. For example, a recombinant gene encoding an 
a2M polypeptide may be constructed by introducing an a2M gene fragment in the proper 
reading frame into a vector containing the sequence of an affinity label, such that the a2M 
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polypeptide is expressed as a peptide-tagged fusion protein. Affinity labels, which may be 
recognized by specific binding partners, may be used for affinity purification of the a2M 
polypeptide. 

In a preferred embodiment, the affinity label is fused at its amino terminal to the 
carboxyl terminal of a2M. The precise site at which the fusion is made in the carboxyl 
terminal is not critical. The optimal site can be determined by routine experimentation. 

A variety of affinity labels known in the art may be used, such as, but not limited to, 
the immunoglobulin constant regions, polyhistidine sequence (Petty, 1996, Metal-chelate 
affinity chromatography, in Current Protocols in Molecular Biology, Vol. 2, Ed. Ausubel et 
al 9 Greene Publish. Assoc. & Wiley Interscience), glutathione S-transferase (GST; Smith, 
1993, Methods Mol. Cell Bio. 4:220-229), the E. coli maltose binding protein (Guan et al 9 
1987, Gene 67:21-30), and various cellulose binding domains (U.S. Patent Kos. 5,496,934; 
5,202,247; 5,137,819; Tommee* a/., 1994, Protein Eng. 7:117-123), etc. Other affinity 
labels may impart fluorescent properties to an a2M polypeptide, e.g., portions of green 
fluorescent protein and the like. Other possible affinity labels are short amino acid sequences 
to which monoclonal antibodies are available, such as but not limited to the following well 
known examples, the FLAG epitope, the myc epitope at amino acids 408-439, the influenza 
virus hemagglutinin (HA) epitope. Other affinity labels are recognized by specific binding 
partners and thus facilitate isolation by affinity binding to the binding partner which can be 
immobilized onto a solid support. Some affinity labels may afford the a2M polypeptide 
novel structural properties, such as the ability to form multimers. Dimerization of an a2M 
polypeptide with a bound peptide may increase avidity of interaction between the <x2M 
polypeptide and its partner in the course of antigen presentation. These affinity labels are 
usually derived from proteins that normally exist as homopolymers. Affinity labels such as 
the extracellular domains of CD8 (Shiue et aL, 1988, J. Exp. Med. 168:1993-2005), or CD28 
(Lee et al 9 1990, J. Immunol. 145:344-352), or portions of the immunoglobulin molecule 
containing sites for interchain disulfide bonds, could lead to the formation of multimers. As 
will be appreciated by those skilled in the art, many methods can be used to obtain the coding 
region of the above-mentioned affinity labels, including but not limited to, DNA cloning, 
DNA amplification, and synthetic methods. Some of the affinity labels and reagents for their 
detection and isolation are available commercially. 

A preferred affinity label is a non-variable portion of the immunoglobulin molecule. 
Typically, such portions comprise at least a functionally operative CH2 and CH3 domain of 
the constant region of an immunoglobulin heavy chain. Fusions are also made using the 
carboxyl terminus of the Fc portion of a constant domain, or a region immediately amino- 
terminal to the CHI of the heavy or light chain. Suitable immunoglobulin-based affinity 
label may be obtained from IgG-1, -2, -3, or -4 subtypes, IgA, IgE, IgD, or IgM, but 
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preferably IgGl. Preferably, a human immunoglobulin is used when the <x2M polypeptide is 
intended for in vivo use for humans. Many DNA encoding immunoglobulin light or heavy 
chain constant regions is known or readily available from cDNA libraries. See, for example, 
Adams et al, Biochemistry, 1980, 19:271 1-2719; Gough et al, 1980, Biochemistry, 
19:2702-2710; Dolby et al, 1980, Proa Natl. Acad. Sci. U.S.A., 77:6027-6031; Bice et al, 
1982, Proc. Natl. Acad. Sci. U.S.A., 79:7862-7865; Falkner etal, 1982, Nature, 298:286- 
288; and Morrison etal, 1984, Ann. Rev. Immunol, 2:239-256. Because many 
immunological reagents and labeling systems are available for the detection of 
immunoglobulins, the <x2M polypeptide-Ig fusion protein can readily be detected and 
quantified by a variety of immunological techniques known in the art, such as the use of 
enzyme-linked immunosorbent assay (ELISA), immunoprecipitation, fluorescence activated 
cell sorting (FACS), etc. Similarly, if the affinity label is an epitope with readily available 
antibodies, such reagents can be used with the techniques mentioned above to detect, 
quantitate, and isolate the a2M polypeptide containing the affinity label. In many instances, 
there is no need to develop specific antibodies to the a2M polypeptide. 

A particularly preferred embodiment is a fusion of an a2M polypeptide to the hinge, 
the CH2 and CH3 domains of human immunoglobulin G-l (IgG-1; see Bowen et a/., 1996, J. 
Immunol. 156:442-49). This hinge region contains three cysteine residues which are 
normally involved in disulfide bonding with other cysteines in the Ig molecule. Since none 
of the cysteines are required for the peptide to function as a tag, one or more of these cysteine 
residues may optionally be substituted by another amino acid residue, such as for example, 
serine. 

Various leader sequences known in the art can be used for the efficient secretion of 
a2M polypeptide from bacterial and mammalian cells (von Heijne, 1985, J. Mol. Biol. 
184:99-105). Leader peptides are selected based on the intended host cell, and may include 
bacterial, yeast, viral, animal, and mammalian sequences. For example, the herpes virus 
glycoprotein D leader peptide is suitable for use in a variety of mammalian cells. A preferred 
leader peptide for use in mammalian cells can be obtained from the V-J2-C region of the 
mouse immunoglobulin kappa chain (Bernard et al, 1981, Proc. Natl. Acad. Sci. 78:5812- 
5816). Preferred leader sequences for targeting <x2M polypeptide expression in bacterial cells 
include, but are not limited to, the leader sequences of the E.coli proteins OmpA (Hobom et 
al., 1995, Dev. Biol. Stand, 84:255-262), Pho A (Oka et al, 1985, Proc. Natl. Acad. Sci 
82:7212-16), OmpT (Johnson et al, 1996, Protein Expression 7:104-1 13), LamB and OmpF 
(Hoffinan & Wright, 1985, Proc. Natl. Acad. Sci. USA 82:5107-51 11), P-lactamase 
(Kadonaga et al, 1984, J. Biol. Chem. 259:2149-54), enterotoxins (Morioka-Fujimoto et al, 
1991, J. Biol. Chem. 266:1728-32), and the Staphylococcus aureus protein A (Abrahmsen et 
al, 1986, Nucleic Acids Res. 14:7487-7500), and the 5. subtilis endoglucanase (Lo et al, 
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Appl. Environ. Microbiol 54:2287-2292), as well as artificial and synthetic signal sequences 
(Maclntyre et al, 1990, Mol. Gen. Genet 221:466-74; Kaiser et aL, 1987, Science, 235:312- 
317). 

DNA sequences encoding a desired affinity label or leader peptide, which may be 
readily obtained from libraries, produced synthetically, or may be available from commercial 
suppliers, are suitable for the practice of this invention. Such methods are well known in the 
art. 
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5.1.1.2 RECOMBINANT EXPRESSION 

In various embodiments of the invention, sequences encoding an a2M polypeptide 
are inserted into an expression vector for propagation and expression in recombinant cells. 

An expression construct, as used herein, refers to a nucleotide sequence encoding an 
a2M polypeptide operably associated with one or more regulatory regions which allows 
expression of the a2M polypeptide in an appropriate host cell. "Operably-associated" refers 
to an association in which the regulatory regions and the a2M polypeptide sequence to be 
expressed are joined and positioned in such a way as to permit transcription, and ultimately, 
translation. 

Vectors based on E. coli are the most popular and versatile systems for high level 
expression of foreign proteins (Makrides, 1996, Microbiol Rev, 60:512-538). Non-limiting 
examples of regulatory regions that can be used for expression in E. coli may include but not 
limited to lac, trp, Ipp.phoA, recA, tac, XP U and phage T3 and T7 promoters (Makrides, 
1996, Microbiol Rev, 60:512-538). Non-limiting examples of prokaryotic expression vectors 
may include the Xgt vector series such as Agtl 1 (Huynh et al. 9 1984 in "DNA Cloning 
Techniques", Vol. I: A Practical Approach (D. Glover, ed.), pp. 49-78, IRL Press, Oxford), 
and the pET vector series (Studier et al, 1990, Methods Enzymol., 185:60-89). However, a 
potential drawback of a prokaryotic host-vector system is the inability to perform many of 
the post-translational processing events of mammalian cells. Thus, an eukaiyotic host- vector 
system is preferred, a mammalian host- vector system is more preferred, and a human host- 
vector system is the most preferred. 

The regulatory regions necessary for transcription of the a2M polypeptide can be 
provided by the expression vector. A translation initiation codon (ATG) may also be 
provided to express a nucleotide sequence encoding an <x2M polypeptide that lacks an 
initiation codon. In a compatible host-construct system, cellular proteins required for 
transcription, such as RNA polymerase and transcription factors, will bind to the regulatory 
regions on the expression construct to effect transcription of the a2M polypeptide sequence 
in the host organism. The precise nature of the regulatory regions needed for gene 
expression may vary from host cell to host cell. Generally, a promoter is required which is 
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capable of binding RNA polymerase to initiate the transcription of an operably-associated 
nucleic acid sequence. Such regulatory regions may include those 5-non-coding sequences 
involved with initiation of transcription and translation, such as the TATA box, the cap site, 
a CAAT box, and the like. The non-coding region 3* to the coding sequence may contain 
transcriptional termination regulatory sequences, such as terminators and polyadenylation 
sites. 

Both constitutive and inducible regulatory regions may be used for expression of the 
a2M polypeptide. It may be desirable to use inducible promoters when the conditions 
optimal for growth of the recombinant cells and the conditions for high level expression of 
the a2M polypeptide are different. Examples of useful regulatory regions are provided in 
the next section below. 

For expression of a2M polypeptides in mammalian host cells, a variety of regulatory 
regions can be used, for example, the S V40 early and late promoters, the cytomegalovirus 
(CMV) immediate early promoter, and the Rous sarcoma virus long terminal repeat (RSV- 
LTR) promoter. Inducible promoters that may be useful in mammalian cells include but are 
not limited to those associated with the metallothionein II gene, mouse mammary tumor 
virus glucocorticoid responsive long terminal repeats (MMTV-LTR), the p-interferon gene, 
and the a2M70 gene (Williams et aL, 1989, Cancer Res. 49:2735-42 ; Taylor et aL, 1990, 
Mol. Cell Biol., 10:165-75). 

The following animal regulatory regions, which exhibit tissue specificity and have 
been utilized in transgenic animals, can also be used in tumor cells of a particular tissue type: 
elastase I gene control region which is active in pancreatic acinar cells (Swift et aL, 1984, 
Cell 38:639-646; Omitz et aL, 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; 
MacDonald, 1987, Hepatology 7:425-515); insulin gene control region which is active in 
pancreatic beta cells (Hanahan, 1985, Nature 315:1 15-122), immunoglobulin gene control 
region which is active in lymphoid cells (Grosschedl et aL, 1984, Cell 38:647-658; Adames 
et aL, 1985, Nature 318:533-538; Alexander et aL, 1987, Mol. Cell. Biol. 7:1436-1444), 
mouse mammary tumor virus control region which is active in testicular, breast, lymphoid 
and mast cells (Leder et aL, 1986, Cell 45:485-495), albumin gene control region which is 
active in liver (Pinkert et aL, 1987, Genes and Devel 1:268-276), alpha-fetoprotein gene 
control region which is active in liver (Krumlauf et aL, 1985, Mol. Cell. Biol. 5:1639-1648; 
Hammer et aL, 1987, Science 235:53-58; alpha 1-antitrypsin gene control region which is 
active in the liver (Kelsey et aL, 1987, Genes and Devel. 1:161-171), beta-globin gene 
control region which is active in myeloid cells (Mogram et aL, 1985, Nature 315:338-340; 
Kollias et aL, 1986, Cell 46:89-94; myelin basic protein gene control region which is active 
in oligodendrocyte cells in the brain (Readhead et aL, 1987, Cell 48:703-712); myosin light 
chain-2 gene control region which is active in skeletal muscle (Sani, 1985, Nature 314:283- 
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286), and gonadotropic releasing hormone gene control region which is active in the 
hypothalamus (Mason et at., 1986, Science 234:1372-1378). 

The efficiency of expression of the <x2M polypeptide in a host cell may be enhanced 
by the inclusion of appropriate transcription enhancer elements in the expression vector, such 
as those found in SV40 virus^ Hepatitis B virus, cytomegalovirus, immunoglobulin genes, 
metallothionein, JJ-actin (see Bittner et aL 9 1987, Methods in Enzymol. 153:516-544; 
Gorman, 1990, Curr. Op. in Biotechnol. 1:36-47). 

The expression vector may also contain sequences that permit maintenance and 
replication of the vector in more than one type of host cell, or integration of the vector into 
the host chromosome. Such sequences may include but are not limited to replication origins, 
autonomously replicating sequences (ARS), centromere DNA, and telomere DNA. It may 
also be advantageous to use shuttle vectors that can be replicated and maintained in at least 
two types of host cells. 

In addition, the expression vector may contain selectable or screenable marker genes 
for initially isolating or identifying host cells that contain DNA encoding an a2M 
polypeptide. For long term, high yield production of a2M polypeptide-antigenic molecule 
complexes, stable expression in mammalian cells is preferred. A number of selection 
systems may be used for mammalian cells, including, but not limited, to the Herpes simplex 
virus thymidine kinase (Wigler et aL 9 1977, Cell 1 1:223), hypoxanthine-guanine 
phosphoribosyltransferase (Szybalski and Szybalski, 1962, Proc. Natl. Acad. Sci. USA 
48:2026), and adenine phosphoribosyltransferase (Lowy et al. 9 1980, Cell 22:817) genes can 
be employed in tk\ hgprf or aprf cells, respectively. Also, antimetabolite resistance can be 
used as the basis of selection for dihydrofolate reductase (dhfr), which confers resistance to 
methotrexate (Wigler etal, 1980, Natl. Acad. Sci. USA 77:3567; O'Hare etaL, 1981, Proc. 
Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance to mycophenolic acid 
(Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); neomycin 
phosphotransferase (neo), which confers resistance to the aminoglycoside G-418 (Colberre- 
Garapin et al, 1981, J. Mol. Biol. 150:1); and hygromycin phosphotransferase (hyg), which 
confers resistance to hygromycin (Santerre et al 9 1984, Gene 30:147). Other selectable 
markers, such as but not limited to histidinol and Zeocin™ can also be used. 

In order to insert the a2M polypeptide DNA sequence into the cloning site of a 
vector, DNA sequences with regulatory functions, such as promoters, must be attached to 
DNA sequences encoding the a2M peptide-binding region. To do this, linkers or adapters 
providing the appropriate compatible restriction sites may be ligated to the ends of cDNA or 
synthetic DNA encoding an a2M polypeptide, by techniques well known in the art (Wu et 
al, 1987, Methods in En2ymol 1 52:343-349). Cleavage with a restriction enzyme can be 
followed by modification to create blunt ends by digesting back or filling in single-stranded 
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DNA termini before ligation. Alternatively, a desired restriction enzyme site can be 
introduced into a fragment of DNA by amplification of the DNA by use of PGR with primers 
containing the desired restriction enzyme site. 

An expression construct comprising an a2M polypeptide sequence operably 
associated with regulatory regions can be directly introduced into appropriate host cells for 
expression and production of a2M polypeptide-antigenic molecule complexes without 
further cloning (see, for example, U.S. Patent No. 5,580,859). The expression constructs 
may also contain DNA sequences that facilitate integration of the a2M polypeptide sequence 
into the genome of the host cell, e.g., via homologous recombination. In this instance, it is 
not necessary to employ an expression vector comprising a replication origin suitable for 
appropriate host cells in order to propagate and express the a2M polypeptide in the host 
cells. 

Expression constructs containing cloned nucleotide sequence encoding a2M 
polypeptides can be introduced into the host cell by a variety of techniques known in the art, 
including but not limited to, for prokaryotic cells, bacterial transformation (Hanahan, 1985, 
in DNA Cloning, A Practical Approach, 1:109-136), and for eukaryotic cells, calcium 
phosphate mediated transfection (Wigler et al 9 1977, Cell 1 1 :223-232), liposome-mediated 
transfection (Schaefer-Ridder et aL, 1982, Science 215:166-168), electroporation (Wolffs 
aL, 1987, Proc Natl Acad Sci 84:3344), and microinjection (Cappechi, 1980, Cell 22:479- 
488). Co-expression of an a2M polypeptide and an antigenic molecule in the same host cell 
can be achieved by essentially the same methods. 

For long term, high yield production of properly processed d2M polypeptides or <x2M 
polypeptide-antigenic molecule complexes, stable expression in mammalian cells is 
preferred. Cell lines that stably express a2M polypeptides or <x2M polypeptide-antigenic 
molecule complexes may be engineered by using a vector that contains a selectable marker. 
By way of example but not limitation, following the introduction of the expression 
constructs, engineered cells may be allowed to grow for 1-2 days in an enriched media, and 
then are switched to a selective media. The selectable marker in the expression construct 
confers resistance to the selection and optimally allows cells to stably integrate the 
expression construct into their chromosomes and to grow in culture and to be expanded into 
cell lines. Such cells can be cultured for a long period of time while ct2M polypeptide is 
expressed continuously. 

The recombinant cells may be cultured under standard conditions of temperature, 
incubation time, optical density, and media composition. Alternatively, a recombinant 
antigenic cells may be cultured under conditions emulating the nutritional and physiological 
requirements of the cancer cell or infected cell. However, conditions for growth of 
recombinant cells may be different from those for expression of a2M polypeptides and 
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antigenic proteins. Modified culture conditions and media may also be used to enhance 
production of a2M-antigenic molecule complexes. Any techniques known in the art may be 
applied to establish the optimal conditions for producing <x2M polypeptide or <x2M 
polypeptide-antigenic molecule complexes. 

5.1.1.3 PURIFICATION METHODS FOR RECOMBINANT a2M 
POLYPEPTIDES 

Generally, the a2M polypeptides of the invention can be recovered and purified from 
recombinant cell cultures by known methods, including ammonium sulfate precipitation, 
acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, immunoaffinity chromatography, hydroxyapatite chromatography, and 
lectin chromatography. 

The invention provides methods for purification of recombinant a2M polypeptides by 
affinity purification, based on the properties of the affinity label present on the a2M 
polypeptide. One approach is based on specific molecuMr interactions between a tag and its 
binding partner. The other approach relies on the immunospecific binding of an antibody to 
an epitope present on the tag. The principle of affinity chromatography well known in the art 
is generally applicable to both of these approaches. 

Described below are several methods based on specific molecular interactions of a 
tag and its binding partner. 

/A) 

A method that is generally applicable to purifying recombinant a2Ms that are fused to 
the constant regions of immunoglobulin is protein A affinity chromatography, a technique 
that is well known in the art. Staphylococcus protein A is a 42 kD polypeptide that binds 
specifically to a region located between the second and third constant regions of heavy chain 

^ immunoglobulins. Because of the Fc domains of different classes, subclasses and species of 
immunoglobulins, affinity of protein A for human Fc regions is strong, but may vary with 
other species. Subclasses that are less preferred include human IgG-3, and most rat 
subclasses. For certain subclasses, protein G (of Streptococci) may be used in place of 
protein A in the purification. Protein-A sepharose (Pharmacia or Biorad) is a commonly 

^ used solid phase for affinity purification of antibodies, and can be used essentially in the 
same manner for the purification of <x2M polypeptide fused to an immunoglobulin Fc 
fragment. Secreted a2M polypeptide present in cell supernatant binds specifically to protein 
A on the solid phase, while the contaminants are washed away. Bound a2M polypeptide can 
be eluted by various buffer systems known in the art, including a succession of citrate, 

35 acetate and glycine-HCl buffers which gradually lowers the pH. This method is less 

preferred if the recombinant cells also produce antibodies which will be copurified with the 
a2M polypeptide. See, for example, Langone, 1982, J. Immunol, meth. 51:3; Wilchek et al 9 
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1982, Biochem. Intl. 4:629; Sjobring et aL, 1991, J. Biol. Chem. 26:399; page 617-618, in 
Antibodies A Laboratory Manual, edited by Harlow and Lane, Cold Spring Harbor 
laboratory, 1988. 

Alternatively, a polyhistidine tag may be used, in which case, the a2M polypeptide 
can be purified by metal chelate chromatography. The polyhistidine tag, usually a sequence 
of six histidines, has a high affinity for divalent metal ions, such as nickel ions (Ni 2 *), which 
can be immobilized on a solid phase, such as nitrilotriacetic acid matrices. Polyhistidine has 
a well characterized affinity for Ni 2+ -NTA-agarose, and can be eluted with either of two mild 
treatments: imidazole (0.1-0.2 M) will effectively compete with the resin for binding sites; or 
lowering the pH just below 6.0 will protonate the histidine side-chains and disrupt the 
binding. The purification method comprises loading the cell culture supernatant onto the 
Ni 2+ -NTA-agarose column, washing the contaminants through, and eluting the a2M 
polypeptide with imidazole or weak acid. Ni 2+ -NTA-agarose can be obtained from 
commercial suppliers such as Sigma (St. Louis) and Qiagen. Antibodies that recognize the 
polyhistidine tag are also available which can be used to detect and quantify the a2M 
polypeptide. 

Another exemplary affinity label that can be used is the glutathione-S-transferase 
(GST) sequence, originally cloned from the helminth, Schistosoma japonicum. In general, an 
a2M-GST fusion expressed in a prokaryotic host cell, such as E. coli, can be purified from 
the cell culture supernatant by absorption with glutathione agarose beads, followed by 
elution in the presence of free reduced glutathione at neutral pH. Denaturing conditions are 
not required at any stage during purification, and therefore, it may be desirable for use in the 
loading of immobilized a2M polypeptides with antigenic peptides. Moreover, since GST is 
known to form dimers under certain conditions, dimeric a2M polypeptides may be obtained. 
See, Smith, 1993, Methods Mol. Cell Bio. 4:220-229. 

Another useful affinity label that can be used is the maltose binding protein (MBP) of 
E. coli, which is encoded by the malE gene. The secreted a2M polypeptide-MBP present in 
the cell supernatant binds to amylose resin while contaminants are washed away. The bound 
<x2M polypeptide-MBP is eluted from the amylose resin by maltose. See, for example, Guan 
etal, 1987, Gene 67:21-30. 

The second approach for purifying ot2M polypeptide is applicable to affinity labels 
that contain an epitope for which polyclonal or monoclonal antibodies are available. Various 
methods known in the art for purification of protein by immunospecific binding, such as 
immunoaffinity chromatography, and immunoprecipitation, can be used. See, for example, 
Chapter 13 in "Antibodies A Laboratory Manual", 1988, Harlow and Lane, (eds.), Cold 
Spring Harbor Laboratory, N.Y. and Chapter 8, Sections I and n, in "Current Protocols in 
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Immunology", 1991, Coligan et al (eds.), John Wiley, the disclosure of which are both 
incorporated by reference herein. 

The embodiments described above may be used to recover and purify <x2M 
polypeptide-antigenic molecule complexes from the cell culture medium of mammalian 
cells, such as human cells expressing an a2M polypeptide of the invention. The methods can 
be adapted to perform medium and large scale purification of an a2M polypeptide and/or 
<x2M-antigenic molecule complexes. Methods that do not require lowering pH or denaturing 
conditions are most preferred for purification of a2M polypeptide-antigenic molecule 
complexes. The methods may be used to isolate a2M polypeptides from eukaryotic cells, for 
example, cancer cells, tissues, isolated cells, or immortalized eukaryote cell lines infected 
with an intracellular pathogen, or cells obtained from a subject infected with a pathogen. 

5.1.1.4 HOST-VECTOR SYSTEMS 

Described herein are systems of vectors and host cells that can be used for the 

15 expression of a2M polypeptides. A variety of expression vectors may be used in the present 
invention which include, but are not limited to, plasmids, cosmids, phage, phagemids, or 
modified viruses. Typically, such expression vectors comprise a functional origin of 
replication for propagation of the vector in an appropriate host cell, one or more restriction 
endonuclease sites for insertion of the a2M polypeptide gene sequence, and one or more 

20 selection markers. The expression vector must be used with a compatible host cell which 
may be derived from a prokaiyotic or an eukaryotic organism including but not limited to 
bacteria, yeasts, insects, mammals, and humans. 

Expression constructs and vectors are introduced into host cells for the purpose of 
producing an a2M polypeptide. Any cell type that can produce <x2Ms and is compatible with 

25 the expression vector may be used, including those that have been cultured in vitro or 
genetically engineered. Host cells may be obtained from normal or affected subjects, 
including healthy humans, cancer patients, and patients with an infectious disease, private 
laboratory deposits, public culture collections such as the American Type Culture Collection, 
or from commercial suppliers. 

30 Different host cells have characteristic and specific mechanisms for tile post-transla- 

tional processing and modification of proteins. A host cell may be chosen which modifies 
and processes the expressed gene products in a specific fashion similar to the way the 
recipient processes a2Ms. For the purpose of producing large amounts of a2M, it is 
preferable that the type of host cell used in the present invention has been used for expression 

35 of heterologous genes, and is reasonably well characterized and developed for large-scale 
production processes. In a specific embodiment, the host cells are from the same patient to 
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whom a2M polypeptide-antigenic molecule complexes or recombinant cells expressing a2M 
polypeptide-antigenic molecule complexes are going to be administered. Otherwise said, the 
cells used to express the a2M polypeptide and used subsequently to administer 
immunotherapy to a subject are autologous to the subject. 

Preferred mammalian host cells include but are not limited to those derived from 
humans, monkeys and rodents, (see, for example, Kriegler M. in "Gene Transfer and 
Expression: A Laboratory Manual", New York, Freeman & Co. 1990), such as monkey 
kidney cell line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney 
line (293, 293-EBNA, or 293 cells subcloned for growth in suspension culture, Graham et 
aL, J. Gen. Virol., 36:59, 1977; baby hamster kidney cells (BHK, ATCC CCL 10); Chinese 
hamster oVary-eells-DHFR (CHO, Urlaub and Chasin. Proc. Natl. Acad. Sci. 77; 4216, 
1980); mouse Sertoli cells (Mather, Biol. Reprod. 23:243-251, 1980); mouse fibroblast cells 
(NIH-3T3), monkey kidney cells (CVI ATCC CCL 70); african green monkey kidney cells 
(VERO-76, ATCC CRL-1587); human cervical carcinoma cells (HELA, ATCC CCL 2); 
canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3A, ATCC CRL 
1442); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); and 
mouse mammary tumor cells (MMT 060562, ATCC CCL51). Exemplary cancer cell types 
used for demonstrating the utility of recombinant cells (producing a2M polypeptide- 
antigenic molecule complexes) as a cancer vaccine are provided as follows: mouse fibroblast 
cell line, NIH3T3, mouse Lewis lung carcinoma cell line, LLC, mouse mastocytoma cell 
line, P815, mouse lymphoma cell line, EL4 and its ovalbumin transfectant, E.G7, mouse 
melanoma cell line, B16F10, mouse fibrosarcoma cell line, MC57, and human small cell 
lung carcinoma cell lines, SCLC#2 and SCLC#7. 

A number of viral-based expression systems may also be utilized with mammalian 
cells to produce a2M polypeptides. Vectors using DNA virus backbones have been derived 
from simian virus 40 (SV40) (Hamer et aL, 1979, Cell 17:725), adenovirus (Van Doren et 
aL, 1984, Mol Cell Biol 4:1653), adeno-associated virus (McLaughlin et aL, 1988, J Virol 
62:1963), and bovine papillomas virus (Zinn et aL, 1982, Proc Natl Acad Sci 79:4897). In 
cases where an adenovirus is used as an expression vector, the donor DNA sequence may be 
ligated to an adenovirus transcription/translation control region, e.g., the late promoter and 
tripartite leader sequence. This chimeric gene may then be inserted in the adenovirus 
genome by in vitro or in vivo recombination. Insertion in a non-essential region of the viral 
genome (e.g., region El or E3) will result in a recombinant virus that is viable and capable of 
expressing heterologous products in infected hosts (see e.g., Logan and Shenk, 1984, Proc. 
Natl. Acad. Sci. (USA) 81:3655-3659). 

Bovine papillomavirus (BPV) can infect many higher vertebrates, including man, and 
its DNA replicates as an episome. A number of shuttle vectors have been developed for 
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recombinant gene expression which exist as stable, multicopy (20-300 copies/cell) 
extrachromosomal elements in mammalian cells. Typically, these vectors contain a segment 
of BPV DNA (the entire genome or a 69% transforming fragment), a promoter with a broad 
host range, a polyadenylation signal, splice signals, a selectable marker, arid "poisonless" 
plasmid sequences that allow the vector to be propagated in E. coll Following construction 
and amplification in bacteria, the expression gene construct is transfected into cultured 
mammalian cells, for example, by the techniques of calcium phosphate coprecipitation or 
electroporation. For those host cells that do not manifest a transformed phenotype, selection 
of transformants is achieved by use of a dominant selectable marker, such as histidinol and 
G418 resistance. For example, BPV vectors such as pBCMGSNeo and pBCMGHis may be 
used to express a2M polypeptide sequences (Karasuyama al, Eur. J. Immunol. 18:97-104; 
Ohe et al, Human Gene Therapy, 6:325-33) which may then be transfected into a diverse 
range of cell types for expression of the a2M polypeptide. 

Alternatively, the vaccinia 7.5K promoter may be used (see, e.g., Mackett et al, 
1982, Proc. NatL Acad. Sci. (USA) 79:7415-7419; Mackett et al, 1984, J. Virol. 49:857-864; 
Panicali et al, 1982, Proc. Natl. Acad. Sci. 79:4927-493 1.) In cases where a human host cell 
is used, vectors based on the Epstein-Barr virus (EBV) origin (OriP) and EBV nuclear 
antigen 1 (EBNA-1; a trans-acting replication factor) may be used. Such vectors can be used 
with a broad range of human host cells, e.g., EBO-pCD (Spickofsky et al, 1990, DNA Prot 
Eng Tech 2:14-18), pDR2 and XDR2 (available from Clontech Laboratories). 

a2M polypeptides may also be made with a retrovirus-based expression system. In 
contrast to transfection, retroviruses can efficiently infect and transfer genes to a wide range 
of cell types including, for example, primary hematopoietic cells. In retroviruses such as 
Moloney murine leukemia virus, most of the viral gene sequences can be removed and 
replaced with nucleic acid sequences encoding a2M, while the missing viral functions can be 
supplied in trans. The host range for infection by a retroviral vector can also be manipulated 
by the choice of envelope used for vector packaging. 

For example, a retroviral vector can comprise a 5* long terminal repeat (LTR), a 3' 
LTR, a packaging signal, a bacterial origin of replication, and a selectable marker. The a2M 
polypeptide DNA is inserted into a position between the 5' LTR and 3* LTR, such that 
transcription from the 5* LTR promoter transcribes the cloned DNA. The 5' LTR comprises 
a promoter, including but not limited to an LTR promoter, an R region, a U5 region and a 
primer binding site, in that order. Nucleotide sequences of these LTR elements are well 
known in the art. A heterologous promoter as well as multiple drug selection markers may 
also be included in the expression vector to facilitate selection of infected cells (see 
McLauchlin et al, 1990, Prog. Nucleic Acid Res. and Molec. Biol. 38:91-135; Morgenstern 
etal, 1990, Nucleic Acid Res. 18:3587-3596; Choulikae* al, 1996, J. Virol 70:1792-1798; 
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Boesen et al, 1994, Biotherapy 6:291-302; Salmons and Gunzberg, 1993, Human Gene 
Therapy 4:129-141; and Grossman and Wilson, 1993, Curr. Opin. in Genetics and Devel. 
3:110-114). 

Other useful eukaryotic host-vector system may include yeast and insect systems. In 
yeast, a number of vectors containing constitutive or inducible promoters may be used with 
Saccharomyces cerevisiae (baker's yeast), Schizosaccharomyces pombe (fission yeast), 
Pichia pastoris, and Hansenula polymorpha (methylotropic yeasts). For a review see, 
"Current Protocols in Molecular Biology" Vol. 2, 1988, Ausubel et al (eds.), Greene 
Publish. Assoc. & Wiley Interscience, Ch. 13; Grant et al, 1987, Expression and Secretion 
Vectors for Yeast, 1987, in "Methods in Enzymology", Wu and Grossman (eds.), Acad. 
Press, N.Y., Vol. 153, pp. 516-544; Glover, 1986, DNA Cloning, Vol. H, IRL Press, Wash., 
D.C., Ch. 3; and Bitter, 1987, Heterologous Gene Expression in Yeast, in "Methods in 
Enzymology", Berger and Kimmel (eds.), Acad. Press, N.Y., Vol. 152, pp. 673-684; and 
"The Molecular Biology of the Yeast Saccharomyces", 1982, Strathern et al (eds.), Cold 
Spring Harbor Press, Vols. I and H 

In an insect system a baculovirus, Autographa californica nuclear polyhidrosis virus 
(AcNPV), can be used as a vector to express an a2M polypeptide in Spodoptera fmgiperda 
cells. The a2M polypeptide DNA may be cloned into non-essential regions (for example the 
polyhedrin gene) of the virus and placed under control of an AcNPV promoter (for example 
the polyhedrin promoter). These recombinant viruses are then used to infect host cells in 
which the inserted DNA is expressed (see, e.g., Smith et al, 1983, J. Virol. 46:584; Smith, 
U.S. Patent No. 4,215,051). 

Any of the cloning and expression vectors described herein may be synthesized and 
assembled from known DNA sequences by techniques well known in the art. The 
regulatory regions and enhancer elements can be of a variety of origins, both natural and 
synthetic. Some vectors and host cells may be obtained commercially. Non-limiting 
examples of useful vectors are described in Appendix 5 of Current Protocols in Molecular 
Biology, 1988, ed. Ausubel et al, Greene Publish. Assoc. & Wiley Interscience, which is 
incorporated herein by reference; and the catalogs of commercial suppliers such as Clontech 
Laboratories, Stratagene Inc., and Invitrogen, Inc. 

5.1.1.5 SYNTHETIC PRODUCTION 
An alternative to producing <x2M by recombinant techniques is peptide synthesis. For 
example, a peptide corresponding to a portion of an a2M comprising the substrate-binding 
domain, or which binds peptides in vitro, can be synthesized by use of a peptide synthesizer. 
Conventional peptide synthesis may be used or other synthetic protocols well known in the 
art. 
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In addition, analogs and derivatives of ct2M polypeptides can be chemically 
synthesized. Furthermore, if desired, nonclassical amino acids or chemical amino acid 
analogs can be introduced as a substitution or addition into the a2M sequence. Non-classical 
amino acids include but are not limited to the D-isomers of the common amino acids, a- 
amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, y-Abu, e-Ahx, 
6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, 
norleucine, norvaline, hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, P-alanine, fluoro-amino acids, designer 
amino acids such as P-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, 
and amino acid analogs in general. 

Peptides having a2M amino acid sequences, or a fragment, analog, mutant or 
derivative thereof, may be synthesized by solid-phase peptide synthesis using procedures 
similar to those described by Merrifield, 1963, J. Am. Chem. Soc, 85:2149. During 
synthesis, N-a-protected amino acids having protected side chains are added stepwise to a 
growing polypeptide chain linked by its C-terminal and to an insoluble polymeric support 
i.e., polystyrene beads. The peptides are synthesized by linking an amino group of an N-a- 
deprotected amino acid to an a-carboxyl group of an N-a-protected amino acid that has been 
activated by reacting it with a reagent such as dicyclohexylcarbodiimide. The attachment of 
a free amino group to the activated carboxyl leads to peptide bond formation. The most 
commonly used N-a-protecting groups include Boc which is acid labile and Fmoc which is 
base labile. Details of appropriate chemistries, resins, protecting groups, protected amino 
acids and reagents are well known in the art and so are not discussed in detail herein (See 9 
Atherton, et ai, 1989, Solid Phase Peptide Synthesis: A Practical Approach, IRL Press, and 
Bodanszky, 1993, Peptide Chemistry, A Practical Textbook, 2nd Ed., Springer-Verlag). 

Purification of the resulting a2M polypeptides accomplished using conventional 
procedures, such as preparative HPLC using gel permeation, partition and/or ion exchange 
chromatography. The choice of appropriate matrices and buffers are well known in the art 
and so are not described in detail herein. 



3Q 5.2 ANTIGENIC COMPLEXES COMPRISING a2M POLYPEPTIDES 

5.2.1 ISOLATION OF INTRACELLULAR COMPLEXES OF a2M 
POLYPEPTIDES WITH ANTIGENIC MOLECULES 

Described herein are methods for purifying a2M polypeptides or o2M 
polypeptide-antigenic molecule complexes of the invention from recombinant cells, and, 
35 with minor modifications known in the art, the a2M polypeptide or a2M-antigenic molecule 
complexes from the cell culture. Recombinant cells include, for example, cells expressing 
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antigenic molecules and recombinantly expressing an <x2M polypeptide. Such cells may be 
derived from a variety of sources, including, but not limited to, cells infected with an 
infectious agent and cancer cells. 

The invention provides methods for purification of recombinant <x2M 
polypeptide-antigenic molecule complexes by affinity purification, based on the properties 
of the affinity label present on the a2M polypeptide. One approach is based on specific 
molecular interactions between a tag and its binding partner. The other approach relies on 
the immunospecific binding of an antibody to an epitope present on the tag. The principle of 
affinity chromatography well known in the art is generally applicable to both of these 
approaches. 

To produce a2M polypeptide-antigenic molecule complexes, a nucleotide sequence 
encoding an a2M polypeptide can be introduced into a cell. When an antigenic molecule is 
present in the cell, the <x2M polypeptide can associate intracellularly with the antigenic 
molecule, forming a covalent or a noncovalent complex of a2M polypeptide and the 
antigenic molecule. Cells into which an a2M polypeptide-encoding nucleotide sequence can 
be introduced, include, but are not limited to, epithelial cells, endothelial cells, keratinocytes, 
fibroblasts, muscle cells, hepatocytes; blood cells such as T lymphocytes, B lymphocytes, 
monocytes, macrophages, neutrophils, eosinophils, megakaryocytes, granulocytes; various 
stem or progenitor cells, in particular hematopoietic stem or progenitor cells, e.g., as obtained 
from bone marrow, umbilical cord blood, peripheral blood, fetal liver, etc. The choice of cell 
type depends on the type of tumor or infectious disease being treated or prevented, and can 
be determined by one of skill in the art. In a specific embodiment, an expression construct 
comprising a nucleic acid sequence encoding the <x2M polypeptide is introduced into an 
antigenic cell. As used herein, antigenic cells may include cells that are infected with an 
infectious agent or pathogen, cells infected with non-infectious or non-pathogenic forms of 
an infectious agent or pathogen (e.g., by use of a helper infectious agent), cells infected by or 
engineered to express an attenuated form of an infectious agent or a non-pathogenic or 
replication-deficient variant of a pathogen, pre-neoplastic cells that are infected with a 
cancer-causing infectious agent, such as a virus, but which are not yet neoplastic; or 
antigenic cells that have been exposed to a mutagen or cancer-causing agent, such as, for 
example DNA-damaging agents, radiation, etc. Other cells that can be used are pre- 
neoplastic cells which are in transition from a normal to a neoplastic form as characterized by 
morphology, physiological or biochemical functions. Preferably, the cancer cells and pre- 
neoplastic cells used in the methods of the invention are of mammalian origin. Mammals 
contemplated by this aspect of the invention include humans, companion animals (e.g, dogs 
and cats), livestock animals (e.g., sheep, cattle, goats, pigs and horses), laboratory animals 
(e.g., mice, rats and rabbits), and captive or free wild animals. 
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In various embodiments, any cancer cell, preferably a human cancer cell, can be used 
in the present methods for producing a2M polypeptide-antigenic molecule complexes- The 
cancer cells provide the antigenic peptides which become associated covalently or 
noncovalently with the expressed <x2M polypeptide. a2M polypeptide-antigenic molecule 
complexes are then purified from the cells and used to treat such cancers. Cancers which can 
be treated or prevented with immunogenic compositions prepared by methods of the 
invention include, but are not limited to, tumors such as sarcomas and carcinomas. 
Examples of cancers that are amenable to the methods of the invention are listed in Section 
5.6. Accordingly, any tissues or cells isolated from a pre-neoplastic lesion, a cancer, 
including cancer that has metastasized to multiple remote sites, can be used in the present 
method. For example, cells found in abnormally growing tissue, circulating leukemic cells, 
metastatic lesions as well as solid tumor tissue can be used. 

In another embodiment, cell lines derived from a pre-neoplastic lesion, cancer tissues 
or cancer cells can also be used, provided that the cells of the cell line have at least one or 
more antigenic determinants in common with antigens on the target cancer cells. Cancer 
tissues, cancer cells, cells infected with a cancer-causing agent, other pre-neoplastic cells, 
and cell lines of human origin are preferred. 

Cancer and pre-neoplastic cells can be identified by any method known in the art. 
For example, cancer cells can be identified by morphology, enzyme assays, proliferation 
assays, cytogenetic characterization, DNA mapping, DNA sequencing, the presence of 
cancer-causing virus, or a history of exposure to mutagen or cancer-causing agent, imaging, 
etc. Cancer cells may also be obtained by surgery, endoscopy, or other biopsy techniques. 
If some distinctive characteristics of the cancer cells are known, they can also be obtained or 
purified by any biochemical or immunological methods known in the art, such as but not 
limited to affinity chromatography, and fluorescence activated cell sorting with 
fluorescently tagged antibody against an antigen expressed by the cancer cells). 

Cancer tissues, cancer cells or cell lines may be obtained from a single individual or 
pooled from several individuals. It is not essential that clonal, homogeneous, or purified 
population of cancer cells be used. It is also not necessary to use cells of the ultimate target 
in vivo (e.g., cells from the tumor of the intended recipient), so long as at least one or more 
antigenic determinants on the target cancer cells is present on the cells used for expression of 
the a2M polypeptide. In addition, cells derived from distant metastases may be used to 
prepare an immunogenic composition against the primary cancer. A mixture of cells can be 
used provided that a substantial number of cells in the mixture are cancer cells and share at 
least one antigenic determinant with the target cancer cell. In a specific embodiment, the 
cancer cells to be used in expressing an a2M polypeptide are purified. 
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5.2.2 IN VITRO COMPLEXING 

In another embodiment, complexes of o2M polypeptides and antigenic molecules are 
produced in vitro. Immunogenic a2M polypeptide - antigenic molecule complexes can be 
generated in vitro by any method known in the art for forming a2M polypeptide - antigenic 
molecule complexes. Procedures for forming such a2M-antigenic molecule complexes and 
methods for isolating antigenic peptides are described in detail herein. 

Methods for formation in vitro of noncovalent immunogenic complexes are well 
known in the art. For example, such complexes can be generated in vitro by noncovalent 
complexing of an a2M polypeptide with an antigenic molecule using methods which have 
been previously described for noncovalent coupling of an HSP with an antigenic molecule 
(see e.g., Blachere et al, 1997, supra; PCT publication WO 97/10000, dated March 20, 
1997). Preferably, the immunogenic molecular complex is not prepared by treatment with a 
protease, or with an activating agent such as ammonia or methyamine. In another preferred 
embodiment, the ct2M molecule of the immunogenic molecular complex is not cleaved 
within the "bait" region. In yet another embodiment, the a2M polypeptide is not covalently 
associated with the antigenic molecule through a thioester linkage. 

Methods for covalent coupling are also well known in the art (see, e.g., Osada et al 9 
1987, supra; Osada et al, 1988, supra; Chu and Pizzo 1993, supra; Chu et al, 1994, supra; 
Mitsuda et al. , 1 993 , supra). In one embodiment, for example, when an a2M polypeptide is 
mixed with protease,, During proteolytic activation of a2M, non-proteolytic ligands can 
become covalently bound to the activated thioesters. Non-proteolytic ligands can also be 
incorporated into the activated a2M molecule by ammonia or methylamine during reversal of 
the nucleophilic activation, employing heat (Gran and Pizzo, 1998, Biochemistry, 37: 6009- 
6014). Such conditions that allow fortuitous trapping ofpeptides by a2M are employed to 
prepare the a2M polypeptide - antigenic molecule complexes of the invention. 

For example, in various embodiments of the invention, an a2M polypeptide may be 
mixed with antigenic molecule in the presence of a protease, ammonia or other small amine 
nucleophiles such as methylamine and ethylamine. Non-limiting examples of proteases 
which may be used include trypsin, porcine pancreatic elastase (PEP), human neutrophil 
elastase, cathepsin G, S. aureus V-8 proteinase trypsin, a-chymotrypsin, V8 protease, papain, 
and proteinase K (see Ausubel et al, (eds.), in "Current Protocols in Molecular Biology", 
Greene Publishing Associates and Wiley Interscience, New York, 17.4.6-17.4.8). 

In another embodiment for preparation of covalent a2M polypeptide-antigenic 
molecule complexes, a2M polypeptides and antigenic molecules are prepared, and then 
covalently coupled using, for example, chemical crosslinking. Chemical crosslinking 
methods are well known in the art. For example, in a preferred embodiment, glutaraldehyde 
crosslinking may be used. Glutaradehyde crosslinking has been used for formation of 
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covalent complexes of peptides and hsps (see Barrios et aL> 1992, Eur. J. Immunol. 22: 
1365-1372). In one embodiment, the following protocol is used. Optionally, <x2M 
polypeptides may be pretreated with ATP or low pH prior to complexing, in order to remove 
any peptides that may be associated with the <x2M polypeptide. Preferably, 1 mg of a2M 
polypeptide is crosslinked to 1 mg of peptide in the presence of 0.002% glutaraldehyde for 2 
hours. Glutaraldehyde is removed by dialysis against phosphate buffered saline (PBS) 
overnight (Lussow et al 9 1991, Eur. J. Immunol. 21: 2297-2302). 

Other methods for chemical crosslinking may also be used, in addition other methods 
for covalent attachment of proteins, such as photocrosslinking (see Current Protocols in 
Molecular Biology, Ausubel et al (eds.), Greene Publishing Associates and Wiley 
Interscience, New York). 

Antigenic molecules for covalent or noncovalent a2M polypeptide-antigenic 
molecule complexes may be isolated from various sources, chemically synthesized, or 
produced recombinantly. Such methods can be readily adapted for medium or large scale 
production of the immunotherapeutic or prophylactic vaccines of the invention. 

Following complexing, the immunogenic a2M-antigenic molecule complexes can 
optionally be purified. In a preferred embodiment, such complexes are at least 65%, 70%, 
75%, 80%, 85%, 90%, 95%, 98%, or 100% nonconvalent complexes of a2M and the 
antigenic molecule. Such complexes may be assayed in vitro using, for example, the mixed 
lymphocyte target cell assay (MLTC) described below. Once immunogenic complexes have 
been isolated they can be optionally characterized further in animal models using the 
preferred administration protocols and excipients discussed below. 



5.2.3. a2M - ANTIGENIC MOLECULE FUSION PROTEINS 

In another embodiment, recombinant fusion proteins, comprised of a2M 
sequences linked to antigenic molecule sequences, may be used for immunotherapy. To 
produce such a recombinant fusion protein, an expression vector is constructed using nucleic 
acid sequences encoding a2M fused to sequences encoding an antigenic molecule, using 
recombinant methods known in the art, such as those described in Sections 5.1.1.1 and 
5.1.1.2, above (see Suzue et al, 1997, Proc. Natl. Acad. Sci. U.S.A. 94: 13146-51). a2M- 
antigenic peptide fusions are then expressed and isolated. By specifically designing the 
antigenic peptide portion of the molecule, such fusion proteins can be used to elicit an 
immune response and in immunotherapy against target cancer and infectious diseases. 
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5.2.4 SOURCES OF ANTIGENIC MOLECULES 

Antigenic molecules, or antigenic portions thereof, specific to one or more types of 
cancer or infected cells, can be chosen from among those known in the art. Alternatively 
such antigenic molecules can be selected for their antigenicity or their immunogenicity, as 
determined by immunoassays or by their ability to generate an immune response. 

5.2.4.1 EXOGENOUS ANTIGENIC MOLECULES 
Preferably, where it is desired to treat or prevent cancer, known tumor-specific 
antigenic molecules or fragments or derivatives thereof are used. For example, such tumor 
specific or tumor-associated antigenic molecules include but are not limited to KS 1/4 pan- 
carcinoma antigen (Perez and Walker, 1990, J. Immunol. 142:3662-3667; Bumal, 1988, 
Hybridoma 7(4):407-415); ovarian carcinoma antigen (CA125) (Yu, et aL> 1991, Cancer 
Res. 51(2):468-475); prostatic acid phosphate (Tailer, et al 9 1990, NucL Acids Res. 
18(16):4928); prostate specific antigen (Henttu and Vihko, 1989, Biochem. Biophys. Res. 
Comm. 160(2):903-910; Israeli, etal, 1993, Cancer Res. 53:227-230); melanoma-associated 
antigen p97 (Estin, et al. 9 1989, 1 Natl. Cancer Inst. 81(6):445-446); melanoma antigen gp75 
(Vijayasardahl, et al, 1990, J. Exp. Med 171(4);1375-1380); high molecular weight 
melanoma antigen (Natali, etal, 1987, Cancer 59:55-63) and prostate specific membrane 
antigen. 

In a specific embodiment, an antigenic molecule or fragment or derivative thereof 
specific to a certain tumor is selected for complexing to a2M polypeptide and subsequent 
administration to a patient having that tumor. 

In a preferred embodiment, where it is desired to treat or prevent viral diseases, 
molecules comprising epitopes of known viruses are used. For example, such antigenic 
epitopes may be prepared from viruses including, but not limited to, hepatitis type A, 
hepatitis type B, hepatitis type C, influenza, varicella, adenovirus, herpes simplex type I 
(HSV-I), herpes simplex type H (HSV-II), rinderpest, rhinovirus, echovirus, rotavirus, 
respiratory syncytial virus, papilloma virus, papova virus, cytomegalovirus, echinovirus, 
arbovirus, huntavirus, coxsackie virus, mumps virus, measles virus, rubella virus, polio virus, 
human immunodeficiency virus type I (HTV-I), and human immunodeficiency virus type II 
(HIV-II). 

In another preferred embodiment, where it is desired to treat or prevent bacterial 
infections, molecules comprising epitopes of known bacteria are used. For example, such 
antigenic epitopes may be prepared from bacteria including, but not limited to, mycobacteria 
rickettsia, mycoplasma, neisseria and legionella. 

In another preferred embodiment, where it is desired to treat or prevent protozoal 
infections, molecules comprising epitopes of known protozoa are used. For example, such 
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antigenic epitopes may be prepared from protozoa including, but not limited to, leishmania, 
kokzidioa, and trypanosoma. 

In yet another preferred embodiment, where it is desired to treat or prevent parasitic 
infections, molecules comprising epitopes of known parasites are used. For example, such 
antigenic epitopes may be from parasites including, but not limited to, chlamydia and 
rickettsia. 

To determine immunogenicity or antigenicity of a putative antigen by detecting 
binding to antibody, various immunoassays known in the art can be used, including but not 
limited to competitive and non-competitive assay systems using techniques such as 
radioimmunoassays, ELIS A (enzyme linked immunosorbent assay), "sandwich" 
immunoassays, immunoradiometric assays, gel diffusion precipitin reactions, 
immunodiffusion assays, in vivo immunoassays (using colloidal gold, enzyme or 
radioisotope labels, for example), western blots, immunoprecipitation reactions, 
agglutination assays (e.g., gel agglutination assays, hemagglutination assays), complement 
fixatioh assays, immunofluorescence assays, protein A assays, and Immunoelectrophoresis 
assays, etc. In one aspect, antibody binding is detected by detecting a label on the primary 
antibody. In another aspect, the primary antibody is detected by detecting binding of a 
secondary antibody or reagent to the primary antibody. In a further aspect, the secondary 
antibody is labeled. Many means are known in the art for detecting binding in an 
immunoassay and are envisioned for use. In one embodiment for detecting immunogenicity, 
T cell-mediated responses can be assayed by standard methods, e.g., in vitro cytoxicity 
assays or in vivo delayed-type hypersensitivity assays. 

Potentially useful antigenic molecules, or derivatives thereof, can be identified by 
various criteria, such as the antigen's involvement in neutralization of a pathogen's infectivity 
(wherein it is desired to treat or prevent infection by such a pathogen) (Norrby, 1985, 
Summary, in Vaccines 85, Lerner, et al (eds.), Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, pp. 388-389), type or group specificity, recognition by patients 1 antisera 
or immune cells, and/or the demonstration of protective effects of antisera or immune cells 
specific for the antigen. In addition, where it is desired to treat or prevent a disease caused 
by pathogen, the antigen's encoded epitope should preferably display a small or no degree of 
antigenic variation in time or amongst different isolates of the same pathogen. 



5.2.4.2 ANTIGENIC MOLECULES FROM a2M COMPLEXES 

Antigenic peptides for complexing in vitro to a2M polypeptides of the invention can 
35 also be obtained from endogenous complexes of peptides and <x2Ms. Two methods may be 
used to elute the peptide from an a2M-antigenic molecule complex. One approach involves 
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incubating the a2M-antigenic molecule complex in the presence of ATP. The other approach 
involves incubating the complexes in a low pH buffer. 

Briefly, the complex of interest is centrifuged through a Centricon 10 assembly 
(Millipore) to remove any low molecular weight material loosely associated with the 
complex. The large molecular weight fraction may be removed and analyzed by SDS-PAGE 
while the low molecular weight may be analyzed by HPLC as described below. In the ATP 
incubation protocol, the a2M-antigenic molecule complex in the large molecular weight 
fraction is incubated with lOmM ATP for 30 minutes at room temperature. In the low pH 
protocol, acetic acid or trifluoroacetic acid (TFA) is added to the <x2M-antigenic molecule 
complex to give a final concentration of 10% (vol/vol) and the mixture incubated at room 
temperature or in a boiling water bath or any temperature in between, for 10 minutes. 

The resulting samples are centrifuged through a CentriconlO assembly as mentioned 
previously. The high and low molecular weight fractions are recovered. The remaining 
large molecular weight a2M-antigenic molecule complexes can be reincubated with ATP or 
low pH to remove any remaining peptides. 

The resulting lower molecular weight fractions are pooled, concentrated by 
evaporation and dissolved in 0.1% TFA. The dissolved material is then fractionated by 
reverse phase high pressure liquid chromatography (HPLC) using for example a VYDAC 
CI 8 reverse phase column equilibrated with 0. 1 % TFA. The bound material is then eluted at 
a flow rate of about 0.8 ml/min by developing the column with a linear gradient of 0 to 80% 
acetonitrile in 0.1% TFA. The elution of the peptides can be monitored by OD 210 and the 
fractions containing the peptides collected. 



5.2.4.3 PEPTIDE ANTIGENS FROM MHC COMPLEXES 
Peptides bound to MHC molecules in vivo can also be used in vitro to form 
complexes with a2M polypeptides of the invention. The isolation of potentially 
immunogenic peptides from MHC molecules is well known in the art and so is not described 
in detail herein (see, Falk, et al, 1990, Nature 348:248-251; Rotzsche, at al, 1990, Nature 
348:252-254; Elliott, et al, 1990, Nature 348:191-197; Falk, et al, 1991, Nature 351:290- 
296; Demotz, et al, 1989, Nature 343:682-684; Rotzsche, et al, 1990, Science 249:283- 
287), the disclosures of which are incorporated herein by reference. 

Briefly, MHC-antigenic molecule complexes may be isolated by a conventional 
immunoaffinity procedure. The peptides then may be eluted from the MHC-antigenic 
molecule complex by incubating the complexes in the presence of about 0.1% TFA in 
acetonitrile. The eluted peptides may be fractionated and purified by reverse phase HPLC, as 
before. 
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5.2.4.4 SYNTHETIC ANTIGENIC MOLECULES 

The amino acid sequences of the peptides eluted from MHC molecules or a2M may 
be determined either by manual or automated amino acid sequencing techniques well known 
in the art. Once the amino acid sequence of a potentially protective peptide has been 
determined, the peptide may be synthesized in using conventional peptide synthesis or other 
protocols well known in the art. 

Peptides having the same amino acid sequence as those isolated above may be 
synthesized by solid-phase peptide synthesis using procedures similar to those described by 
Merrifield, 1963, X Am. Chem. Soc., 85:2149. During synthesis, N-a-protected amino acids 
having protected side chains are added stepwise to a growing polypeptide chain linked by its 
C-terminal and to an insoluble polymeric support i.e., polystyrene beads. The peptides are 
synthesized by linking an amino group of an N-a-deprotected amino acid to an a-carboxyl 
group of an N-a-protected amino acid that has been activated by reacting it with a reagent 
such as dicyclohexylcarbodiimide. The attachment of a free amino group to the activated 
carboxyl leads to peptide bond formation. The most commonly used N-a-protecting groups 
include Boc which is acid labile and Fmoc which is base labile. Details of appropriate 
chemistries, resins, protecting groups, protected amino acids and reagents are well known in 
the art and so are not discussed in detail herein (See, Atherton, et al 9 1989, Solid Phase 
Peptide Synthesis: A Practical Approach, IRL Press, and Bodanszky, 1993, Peptide 
Chemistry, A Practical Textbook, 2nd Ed., Springer-Verlag). 

Purification of the resulting peptides is accomplished using conventional procedures, 
such as preparative HPLC using gel permeation, partition and/or ion exchange 
chromatography. The choice of appropriate matrices and buffers are well known in the art 
and so are not described in detail herein. 

5.2.4.5 KECOMBINANTLY PRODUCED ANTIGENIC MOLECULES 

In a particular embodiment of the invention, a nucleotide sequence encoding a protein 
antigenic molecule or portions thereof can be introduced into a host cell for production of the 
antigenic molecule. The nucleotide sequence encoding any antigenic protein can be obtained 
and cloned into an expression vector for expression essentially by the same methods 
described for the cloning and expression of a nucleotide sequence encoding an a2M 
polypeptide. The techniques are described in Sections 5.1.1.1 and 5.1.1.2, and are well 
known in the art. The recombinant antigenic protein or portions thereof can be purified by 
any methods appropriate for the protein, and then used to form complexes with a2M 
polypeptides in vitro as described in Section 5.2.2. Such an <x2M polypeptide-antigenic 
molecule complex can be used as a vaccine to stimulate an immune response against the 
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antigenic protein in a subject for the purpose of treatment or prevention of infectious diseases 
or cancer. 

5.3 THERAPEUTIC APPLICATIONS FOR a2M COMPLEXES 

5 The present invention encompasses the use of a2M polypeptides in methods for 

treatment of and prevention of infectious diseases and cancer. In various embodiments 
described in detail herein, an effective amount of a a2M polypeptide in a covalent or 
nonco valent complex with an antigenic molecule is administered to a patient for therapeutic 
purposes. 

10 

5.3.1 PREVENTION AND TREATMENT OF INFECTIOUS DISEASES 
For treatment and prevention of infectious disease, <x2M - antigenic molecule 
complexes are prepared from a cell that displays the antigenicity of an antigen of an 
infectious agent or pathogenic agent, and used as vaccines against the infectious disease. As 

15 will be appreciated by those skilled in the art, the protocols described herein may be used to 
isolate a2M polypeptide-antigenic molecule complexes from any cell that displays the 
antigenicity of an antigen of the infectious agent. For example, cells may be infected by the 
infectious agent itself, or alternatively, cells may be infected by or engineered to express an 
attenuated form of the infectious agent or a non-pathogenic or replication-deficient variant of 

20 the pathogen. In one embodiment, a2M- antigenic molecule complexes can be prepared 
from cells infected with non-infectious or non-pathogenic forms of the infectious agent (e.g., 
by use of a helper infectious agent). In another embodiment, the a2M-antigenic molecule 
complexes of the invention may be prepared from cells infected with an intracellular 
pathogen. In another embodiment, a2M polypeptide-complexes can be prepared from cells 

25 that have been transformed by an intracellular pathogen. For example, immunogenic a2M 
polypeptide-antigenic molecule complexes may be isolated from eukaryotic cells 
transformed with a transforming virus such as S V40. 

A preferred method for treatment or prevention of an infectious disease comprises 
introducing into a cell that displays the antigenicity of an infectious agent an expressible 

30 a2M polypeptide gene sequence, preferably as an expression gene construct. The a2M 
polypeptide gene sequence is manipulated by recombinant methods, such as those described 
above in Sections 5.1.1.1 and 5.1.1.2 above, so that the <x2M polypeptide gene sequence, in 
the form of an expression construct, located extrachromosomally or integrated in the 
chromosome, is suitable for expression of the <x2M polypeptide in the recombinant cells. 

35 The recombinant cells containing the expression gene constructs are cultured under 
conditions such that <x2M polypeptides encoded by the expression gene construct are 
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expressed Complexes of a2M polypeptides covalently or noncovalently associated with 
antigenic molecules of the infectious agent are purified from the cell culture or culture 
medium by the methods described in Section 5.2.1. 

In various embodiments, a2M - antigenic molecule complexes are prepared from a 
cell genetically manipulated to express an a2M polypeptide, for example, tissues, isolated 
cells or immortalized eukaryotic cell lines infected with an intracellular pathogen. When 
immortalized animal cell lines are used as a source of the a2M polypeptide-antigenic 
molecule complex, it is important to use cell lines that can be infected with the pathogen of 
interest. In addition, it is preferable to use cells that are derived from the same species as the 
intended recipient of the vaccine. Techniques for introducing an expressible form of the 
a2M polypeptide gene sequences into these cell lines are described above in Section 5.1.1.2. 
If a pathogen is expected to cause lysis of the host cells, it is preferred to introduce the 
expressible a2M polypeptide gene sequence into the host cell prior to infecting the cells with 
the pathogen. For example, in order to prepare an <x2M polypeptide-antigenic molecule 
complex for administration to humans that may be effective against HIV-1, the virus may be 
propagated in human cells which include, but are not limited to, human CD4+ T cells, 
HepG2 cells, and U937 promonocyte cells, which have already been transfected with an 
expressible a2M polypeptide sequence. Similarly, influenza viruses may be propagated in, 
for example, transfected human fibroblast cell lines and MDCK cells, and mycobacteria may 
be cultured in, for example, transfected human Schwaan cells. The cell supernatant 
containing <x2M-antigenic molecule complex may be collected just prior to lysis of the host 
cell. 

In a preferred aspect of the invention, the purified a2M - antigenic molecule complex 
vaccines may have particular utility in the treatment of human diseases caused by 
intracellular pathogens. It is appreciated, however, that the vaccines developed using the 
principles described herein will be useful in treating diseases of other mammals, for example, 
farm animals including: cattle; horses; sheep; goats; and pigs, and household pets including: 
cats; and dogs, that similarly are caused by intracellular pathogens. 

In accordance with the methods described herein, vaccines may be prepared that 
stimulate an immune response, in particular a cytotoxic T cell responses, against cells 
infected with viruses including, but not limited to, hepatitis type A, hepatitis type B, hepatitis 
type C, influenza, varicella, adenovirus, HSV-I, HSV-H, rinderpest rhinovirus, echovirus, 
rotavirus, respiratory syncytial virus, papilloma virus, papova virus, cytomegalovirus, 
echinoviru$, arbovirus, huntavirus, coxsachie virus, mumps virus, measles virus, rubella 
virus, polio virus, HTV-I, and HTV-II. Similarly, vaccines may also be prepared that 
stimulate cytotoxic T cell responses against cells infected with intracellular bacteria, 
including, but not limited to, Mycobacteria, Rickettsia, Mycoplasma, Neisseria and 
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Legionella. In addition, vaccines may also be prepared that stimulate cytotoxic T cell 
responses against cells infected with intracellular protozoa, including, but not limited to, 
Leishmani, Kokzidioa, and Trypanosoma. Furthermore, vaccines may be prepared that 
stimulate cytotoxic T cell responses against cells infected with intracellular parasites 
including, but not limited to, Chlamydia and Rickettsia. 

The effect of immunotherapy with modified a2M polypeptide-antigenic molecule 
complexes on progression of infectious diseases can be monitored by any methods known to 
one skilled in the art. 
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5.3.2 PREVENTION AND TREATMENT OF CANCER 

There are many reasons why immunotherapy as provided by the covalent or 
noncovalent a2M polypeptide-antigenic molecule complexes or recombinant cells expressing 
a2M polypeptides prepared by the present invention is desired for use in cancer patients. 
First, if cancer patients are immunosuppressed, and surgery with anesthesia, and subsequent 
chemotherapy, may worsen the immunosuppression, then with appropriate immunotherapy 
in the preoperative period, this immunosuppression may be prevented or reversed. This 
could lead to fewer infectious complications and to accelerated wound healing. Second, 
tumor bulk is minimal following surgery and immunotherapy is most likely to be effective in 
this situation. A third reason is the possibility that tumor cells are shed into the circulation at 
surgery and effective immunotherapy applied at this time can eliminate these cells. 

In a specific embodiment, the preventive and therapeutic utility of the invention is 
directed at enhancing the immunocompetence of the cancer patient either before surgery, at 
or after surgery, and at inducing tumor-specific immunity to cancer cells, with the objective 
being inhibition of cancer, and with the ultimate clinical objective being total cancer 
regression and eradication. 

According to the invention, preferred methods of treatment or prevention of cancer 
comprise isolating cancer cells from one or more individual, preferably the individual in need 
of treatment, and introducing into such cells an expressible a2M polypeptide gene sequence, 
preferably as an expression gene construct. The a2M polypeptide gene sequence is 
manipulated by methods described above in Sections 5.1.1.1 and 5. 1 . 1 .2, such that the <x2M 
polypeptide gene sequence, in the form of an expression construct, or intrachromosomally 
integrated, are suitable for expression of the <x2M polypeptide in the recombinant cells. The 
recombinant cells containing the expression gene constructs are cultured under conditions 
such that a2M polypeptides encoded by the expression gene construct are expressed by the 
recombinant host cells. Complexes of a2M polypeptides covalently or noncovalently 
associated with antigenic molecules of the cancer cell are purified from the cell culture or 
culture medium by the methods described in Section 5.2. 1. Depending on the route of 
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administration* the <x2M polypeptide-antigenic molecule complexes are formulated 
accordingly as described in Section 5.7, and administered to the individual autologously 
(e.g., to treat the primary cancer or metastases thereof), or to other individuals who are in 
need of treatment for cancer of a similar tissue type, or to individuals at enhanced risk of 
cancer due to familial history or environmental risk factors. 

For example, treatment with a2M polypeptide - antigenic molecule complexes 
prepared as described above may be started any time after surgery. However, if the patient 
has received chemotherapy, a2M - antigenic molecule complexes are usually administered 
after an interval of four weeks or more so as to allow the immune system to recover. The 
therapeutic regimen may include weekly injections of the <x2M polypeptide - antigenic 
molecule complex, dissolved in saline or other physiologically compatible solution. The 
route and site of injection is varied each time, for example, the first injection is given 
subcutaneously on the left arm, the second injection on the right arm, the third injection on 
the left abdominal region, the fourth injection on the right abdominal region, the fifth 
injection on the left thigh, the sixth injection on the right thigh, etc. The same site is repeated 
after a gap of one or more injections. In addition, injections are split and each half of the 
dose is administered at a different site on the same day. Overall, the first four to six 
injections are given at weekly intervals. Subsequently, two injections are given at two-week 
intervals, followed by a regimen of injections at monthly intervals. 

Alternatively, recombinant tumor cells expressing a2M - antigenic molecule 
complexes can be used as a vaccine for injection into a patient to stimulate an immune 
response against the tumor cells or cells bearing tumor antigens. Autologous recombinant 
tumor cells stably expressing a2M polypeptide-antigenic molecule complexes are preferred. 
To determine the appropriate dose, the amount of a2M polypeptide-antigenic molecule 
complex produced by the recombinant cells is quantitated, and the number of recombinant 
cells used for vaccination is adjusted accordingly to assure a consistent level of expression in 
vivo. A preferred dose is the number of recombinant cells that can produce about 100 ng 
ct2M polypeptide per 24 hours. For the safety of the patient, the recombinant tumor cells can 
be irradiated (12000 rad) immediately prior to injection into a patient. Irradiated cells do not 
proliferate, and can continue to express a2M polypeptide-antigenic molecule complexes for 
about 7-10 days which is sufficient to induce an immune response. 

Cancers that can be treated or prevented by using covalent or noncovalent 
a2M-antigenic molecule complexes prepared by the methods of the present invention 
include, but not limited to human sarcomas and carcinomas, e.g., fibrosarcoma, 
myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, angiosarcoma, 
endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, synovioma, 
mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon carcinoma, 
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pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell carcinoma, 
basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland carcinoma, 
papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary carcinoma, 
bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, 
choriocarcinoma, seminoma, embryonal carcinoma, Wilms 1 tumor, cervical cancer, testicular 
tumor, lung carcinoma, small cell lung carcinoma, bladder carcinoma, epithelial carcinoma, 
glioma, astrocytoma, medulloblastoma^ craniopharyngioma, ependymoma, pinealoma, 
hemangioblastoma, acoustic neuroma, oligodendroglioma, meningioma, melanoma, 
neuroblastoma, retinoblastoma; leukemias, e.g., acute lymphocytic leukemia and acute 
myelocytic leukemia (myeloblastic, promyelocyte, myelomonocytic, monocytic and 
erythroleukemia); chronic leukemia (chronic myelocytic (granulocytic) leukemia and chronic 
lymphocytic leukemia); and polycythemia vera, lymphoma (Hodgkin's disease and non- 
Hodgkin's disease), multiple myeloma, Waldenstrom's macroglobulinemia, and heavy chain 



In a specific embodiment, the cancer is metastatic. In another specific embodiment, 
the patient having a cancer is immunosuppressed by reason of having undergone anti-cancer 
therapy (e.g., chemotherapy radiation) prior to administration of the a2M - antigenic 
molecule complexes of the invention. In another specific embodiment, the cancer is a tumor. 

The effect of immunotherapy with a2M polypeptide-antigenic molecule complexes 
on progression of neoplastic diseases can be monitored by any methods known to one skilled 
in the art, including but not limited to measuring: a) delayed hypersensitivity as an 
assessment of cellular immunity; b) activity of cytolytic T-lymphocytes in vitro; c) levels of 
tumor specific antigens, e.g., carcinoembryonic (CEA) antigens; d) changes in the 
morphology of tumors using techniques such as a computed tomographic (CT) scan; e) 
changes in levels of putative biomarkers of risk for a particular cancer in individuals at high 
risk, and f) changes in the morphology of tumors using a sonogram. Other techniques that 
can also be used include scintigraphy and endoscopy. 

The preventive effect of immunotherapy using a2M polypeptide-antigenic molecule 
complexes may also be estimated by determining levels of a putative biomarker for risk of a 
specific cancer. For example, in individuals at enhanced risk for prostate cancer, serum 
prostate-specific antigen (PSA) is measured by the procedure described by Brawer et al 9 
1992, J. Urol. 147:841-845, and Catalona et al 9 1993, JAMA 270:948-958; or in individuals 
at risk for colorectal cancer, CEA is measured by methods known in the art; and in 
individuals at enhanced risk for breast cancer, 16-a-hydroxylation of estradiol is measured by 
the procedure described by Schneider et al., 1982, Proc. Natl. Acad. Sci. USA 79:3047-305 1 . 
The references cited above are incorporated by reference herein in their entirety. 



disease. 
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5.3.3 COMBINATION WITH ADOPTIVE IMMUNOTHERAPY 

Adoptive immunotherapy refers to a therapeutic approach for treating infectious 
diseases or cancer in which immune cells are administered to a host with the aim that the 
cells mediate specific immunity, either directly or indirectly, to the infected cells or tumor 

5 cells and/or antigenic components, and result in treatment of the infectious disease or 
regression of the tumor, as the case may be (see U.S. Patent Application Serial No. 
08/527,546, filed September 13, 1995, which is incorporated by reference herein in its 
entirety). a2M polypeptides may be used to sensitize antigen presenting cells (APCs) using 
in covalent or noncovalent complexes with antigenic (or immunogenic) molecules, for 

10 adoptive immunotherapy. 

According to the invention, therapy by administration of a2M polypeptide-antigenic 
molecule complexes, using any desired route of administration, is combined with adoptive 
immunotherapy using APC sensitized with <x2M polypeptide-antigenic molecule complexes. 
The o2M polypeptide-antigenic molecule complex-sensitized APC can be administered 

15 concurrently with a2M polypeptide-antigenic molecule Complexes, or before or after 

administration of a2M polypeptide- antigenic molecule complexes. Furthermore, the mode 
of administration can be varied, including but not limited to, e.g., subcutaneously, 
intravenously, intraperitoneally, intramuscularly, intradermally or mucosally. 

20 5.3.3.1 SENSITIZATION OF ANTIGEN PRESENTING CELLS WITH a2M 

COMPLEXES 

The antigen-presenting cells, including but not limited to macrophages, dendritic 
cells and B-cells, are preferably obtained by production in vitro from stem and progenitor 
cells from human peripheral blood or bone marrow as described by Inaba et al y 1992, J. Exp. 

25 Med. 176:1693-1702. APC can be obtained by any of various methods known in the art. In 
a preferred aspect human macrophages are used, obtained from human blood cells. 

By way of example, but not limitation, macrophages can be obtained as follows: 
Mononuclear cells are isolated from peripheral blood of a patient (preferably the patient to be 
treated), by Ficoll-Hypaque gradient centrifugation and are seeded on tissue culture dishes 

30 which are pre-coated with the patient's own serum or with other AB+ human serum. The 
cells are incubated at 37 °C for 1 hr, then non-adherent cells are removed by pipetting. To 
the adherent cells left in the dish, is added cold (4°C) 1 mM EDTA in phosphate-buffered 
saline and the dishes are left at room temperature for 15 minutes. The cells are harvested, 
washed with RPMI buffer and suspended in RPMI buffer. Increased numbers of 

35 macrophages may be obtained by incubating at 37°C with macrophage-colony stimulating 
factor (M-CSF); increased numbers of dendritic cells may be obtained by incubating with 
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granulocyte-macrophage-colony stimulating factor (GM-CSF) as described in detail by 
Inaba, K., et al y 1992, J. Exp. Med. 176:1693-1702. 

APC are sensitized with a2M polypeptides covalently or noncovalently bound to 
antigenic molecules by incubating the cells in vitro with the complexes. The APC are 
sensitized with complexes of <x2M polypeptide and antigenic molecules preferably by 
incubating in vitro with the a2M polypeptide-complex at 37 °C for 15 minutes to 24 hours. 
By way of example but not limitation, 4xl0 7 macrophages can be incubated with 10 
microgram a2M-antigenic molecule complexes per ml or 100 microgram a2M-antigenic 
molecule complexes per ml at 37 °C for 15 mins to 24 hrs in 1 ml plain RPMI medium. The 
■ cells are washed three times and resuspended in a physiological medium preferably sterile, at 
a convenient concentration (e.g. 9 lxl0 7 /ml) for injection in a patient. In a preferred 
embodiment, the antigen presenting cells are autologous to the patient, that is, the patient into 
which the sensitized APCs are injected is the patient from which the APC were originally 
isolated,. 

Optionally, the ability of sensitized APC to stimulate, for example, the antigen- 
specific, class Restricted cytotoxic T-lymphocytes (CTL) can be monitored by their ability 
to stimulate CTLs to release tumor necrosis factor, and by their ability to act as targets of 
such CTLs. 

5.3.3.2 REINFUSION OF SENSITIZED APC 

20 

The <x2M polypeptide-antigen-sensitized APC are reinfiised into the patient 
systemically, preferably intravenously, by conventional clinical procedures. These activated 
cells are reinfused, preferentially by systemic administration into the autologous patient. 
Patients generally receive from about 10 6 to about 10 i2 sensitized macrophages, depending 
25 on the condition of the patient. In some regimens, patients may optionally receive in 

addition a suitable dosage of a biological response modifier including but not limited to the 
cytokines IFN-a, IFN-y, IL-2, IL-4, IL-6, TNF or other cytokine growth factor. 

5.3.4 DETERMINATION OF IMMUNOGENICITY OF a2M- ANTIGEN 
MOLECULE COMPLEXES 

30 

In an optional procedure, the purified a2M polypeptide-antigenic molecule 
complexes can be assayed for immunogenicity using the mixed lymphocyte target culture 
assay (MLTC) well known in the art. 

By way of example but not limitation, the following procedure can be used. Briefly, 
35 mice are injected subcutaneously with the candidate a2M polypeptide-antigenic molecule 
complexes. As a positive control another set of mice are immunized with whole cancer 



-47- 



WO 01/91787 



PCTYUS01/18047 



cells of the type from which the a2M polypeptides are derived. As a negative control, mice 
are injected with either a2M - antigenic molecule complexes isolated from normal, non- 
recombinant cells or whole cells (z.e., antigenically distinct from the type of cell from which 
the a2M polypeptides are derived). The mice are injected twice, 7-10 days apart. The mice 
are injected twice, 7-10 days apart. Ten days after the last immunization, the spleens are 
removed and the lymphocytes released. The released lymphocytes may be restimulated 
subsequently in vitro by the addition of dead cells that expressed the complex of interest. 

For example, 8xl0 6 immune spleen ceils may be stimulated with 4x1 0 4 mitomycin C 
. treated or y-irradiated (5-10,000 rads) pathogen-infected cells (or cells transfected with a 
gene encoding an antigen of the infectious agent, as the case may be), or tumor cells in 3ml 
RPMI medium containing 10% fetal calf serum. In certain cases 33% secondary mixed 
lymphocyte culture supernatant or interleukin 2 (IL-2) may be included in the culture 
medium as a source of T cell growth factors (See, Glasebrook et al. 9 1980, J. Exp. Med 
151 :876). To test the primary cytotoxic T cell response after immunization, spleen cells may 
be cultured without stimulation. In some experiments spleen cells of the immunized mice 
may also be restimulated with antigenically distinct cells, to determine the specificity of the 
cytotoxic T cell response. 

Six days later the cultures are tested for cytotoxicity in a 4 hour 51 Cr-release assay 
(See, Palladino etal, 1987, Cancer Res. 47:5074-5079 andBlachere, at ai., 1993, J. 
Immunotherapy 14:352-356). In this assay, the mixed lymphocyte culture is added to a 
target cell suspension to give different effectontarget (E:T) ratios (usually 1:1 to 40:1). The 
target cells are prelabelled by incubating lxlO 6 target cells in culture medium containing 200 
mCi 51 Cr/ml for one hour at 37°C. The cells are washed three times following labeling. 
Each assay point (E:T ratio) is performed in triplicate and the appropriate controls 
incorporated to measure spontaneous 5I Cr release (no lymphocytes added to assay) and 100% 
release (cells lysed with detergent). After incubating the cell mixtures for 4 hours, the cells 
are pelleted by centrifixgation at 200g for 5 minutes. The amount of 51 Cr released into the 
supernatant is measured by a gamma counter. The percent cytotoxicity is measured as cpm 
in the test sample minus spontaneously released cpm divided by the total detergent released 
cpm minus spontaneously released cpm. 

In order to block the MHC class I cascade a concentrated hybridoma supernatant 
derived from K-44 hybridoma cells (an anti-MHC class I hybridoma) is added to the test 
samples to a final concentration of 12.5% (Heike et aL 9 1994, J. Immunotherapy 15: 165- 
174). 

An alternative to the chromium-release assay is the ELISPOT assay which measures 
cytokine release by cytotoxic T cells in vitro after stimulation with specific antigen. 
Cytokine release is detected by antibodies which are specific for a particular cytokine, such 
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as interleukin-2, tumor necrosis factor a or interferon-y (for example, see Scheibenbogen et 
al 9 1997, Int. J. Cancer, 71:932-936). The assay is carried out in a microtiter plate which has 
been pre-coated with an antibody specific for a cytokine of interest which captures the 
cytokine secreted by T cells. After incubation of T cells for 24-48 hours in the coated wells, 
the cytotoxic T cells are removed and replaced with a second labeled antibody that 
recognizes a different epitope on the cytokine. After extensive washing to remove unbound 
antibody, an enzyme substrate which produces a colored reaction product is added to the 
plate. The number of cytokme-producing cells is counted under a microscope. This method 
has the advantages of short assay time, and sensitivity without the need of a large number of 
cytotoxic T cells. 

5.3.5 MONITORING OF EFFECTS DURING IMMUNOTHERAPY 

The effect of immunotherapy with ct2M polypeptide-antigenic molecule complexes 
can be monitored by any methods known to one skilled in the art, including but not limited to 

15 measuring: a) delayed hypersensitivity as an assessment of cellular immunity; b) activity of 
cytolytic T-lymphocytes in vitro; c) levels of infective agent-agent or tumor-specific 
antigens, e.g., carcinoembryonic (CEA) antigens. In the case of the use of ct2M - antigenic 
molecule complexes for prevention or treatment of cancer, the effect can additionally be 
monitored by: d) changes in the morphology of tumors using techniques such as a computed 

20 tomographic (CT) scan; and e) changes in levels of putative biomarkers of risk for a 
particular cancer in individuals at high risk, and f) changes in the morphology of tumors 
using a sonogram. 



immunocompetence and cellular immunity to an antigen. Inability to react to a battery of 
common skin antigens is termed anergy (Sato et aL, 1995, Clin. Immunol. Pathol. 74:35-43). 

Proper technique of skin testing requires that the antigens be stored sterile at 4°C, 
protected from light and reconstituted shorted before use. A 25- or 27-gauge needle ensures 
30 intradermal, rather than subcutaneous, administration of antigen. Twenty-four and 48 hours 
after intradermal administration of the antigen, the largest dimensions of both erythema and 
induration are measured with a ruler. Hypoactivity to any given antigen or group of antigens 
is confirmed by testing with higher concentrations of antigen or, in ambiguous 
circumstances, by a repeat test with an intermediate test. 



25 



5.3.5.1 DELAYED HYPERSENSITIVITY SKIN TEST 
Delayed hypersensitivity skin tests are of great value in the overall 



35 
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5.3.5.2 IN VITRO ACTIVATION OF CYTOTOXIC T CELLS 

The activity of cytotoxic T-lymphocytes can be assessed in vitro using the following 
method. Eight x 10 6 peripheral blood-derived T lymphocytes isolated by the Ficoll-Hypaque 
centrifugation gradient technique, are restimulated with 4x1 0 4 mitomycinC-treated tumor 

5 cells in 3ml RPMI medium containing 10% fetal calf serum. In some experiments, 33% 
secondary mixed lymphocyte culture supernatant or IL-2, is included in the culture medium 
as a source of T cell growth factors. 

In order to measure the primary response of cytolytic T-lymphocytes after 
immunization, T cells are cultured without the stimulator tumor cells. In other experiments, 

10 T cells are restimulated with antigenically distinct cells. After six days, the cultures are 
tested for cytotoxicity in a 4 hour 51 Cr-release assay. The spontaneous 51 Cr-release of the 
targets should reach a level less than 20%. For the anti-MHC class I blocking activity, a 
tenfold concentrated supernatant of W6/32 hybridoma is added to the test at a final 
concentration of 12.5% (Heike M., et al., 1 Immunotherapy 15:165-174). 

15 

5.3.5.3 LEVELS OF TUMOR SPECIFIC ANTIGENS 

Although it may not be possible to detect unique tumor antigens on all tumors, many 
tumors display antigens that distinguish them from normal cells. The monoclonal antibody 
reagents have permitted the isolation and biochemical characterization of the antigens and 

20 have been invaluable diagnostically for distinction of transformed from nontransformed cells 
and for definition of the cell lineage of transformed cells. The best-characterized human 
tumor-associated antigens are the oncofetal antigens. These antigens are expressed during 
embryogenesis, but are absent or very difficult to detect in normal adult tissue. The 
prototype antigen is carcinoembryonic antigen (CEA), a glycoprotein found on fetal gut and 

25 human colon cancer cells, but not on normal adult colon cells. Since CEA is shed from 
colon carcinoma cells and found in the serum, it was originally thought that the presence of 
this antigen in the serum could be used to screen patients for colon cancer. However, 
patients with other tumors, such as pancreatic and breast cancer, also have elevated serum 
levels of CEA. Therefore, monitoring the fall and rise of CEA levels in cancer patients 

30 undergoing therapy has proven useful for predicting tumor progression and responses to 
treatment. 

Several other oncofetal antigens have been useful for diagnosing and monitoring 
human tumors, e.g., alpha-fetoprotein, an alpha-globulin normally secreted by fetal liver and 
yolk sac cells, is found in the serum of patients with liver and germinal cell tumors and can 
35 be used as a marker of disease status. 
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5.3.5.4 COMPUTED TOMOGRAPHIC (CT) SCAN 

CT remains the choice of techniques for the accurate staging of cancers. CT has 
proved more sensitive and specific than any other imaging techniques for the detection of 
metastases. A sonogram remains an alternative choice of technique for the accurate staging 
of cancers. 

5.3.5.5 MEASUREMENT OF PUTATIVE BIOMARKERS 

The levels of a putative biomarker for risk of a specific cancer are measured to 
monitor the effect of a2M covalently or noncovalently bound to antigenic molecule 
complexes. For example, in individuals at enhanced risk for prostate cancer, serum prostate- 
specific antigen (PSA) is measured by the procedure described by Brawer et. aL, 1992, J. 
Urol. 147:841-845, and Catalona et aL, 1993, JAMA 270:948-958; and in individuals at 
enhanced risk for breast cancer, 1 6-a-hydroxylation of estradiol is measured by the procedure 
described by Schneider et aL, 1982, Proc. Natl. Acad. Sci. ISA 79:3047-3051. 

15 

5.4 TARGET AUTOIMMUNE DISEASES 

Autoimmune diseases that can be treated by the methods of the present invention 
include, but are not limited to, insulin dependent diabetes mellitus (i.e., IDDM, or 
autoimmune diabetes), multiple sclerosis, systemic lupus erythematosus, Sjogren's 

20 syndrome, scleroderma, polymyositis, chronic active hepatitis, mixed connective tissue 
disease, primary biliary cirrhosis, pernicious anemia, autoimmune thyroiditis, idiopathic 
Addison's disease, vitiligo, gluten-sensitive enteropathy, Graves' disease, myasthenia gravis, 
autoimmune neutropenia, idiopathic thrombocytopenia purpura, rheumatoid arthritis, 
cirrhosis, pemphigus vulgaris, autoimmune infertility, Goodpasture's disease, bullous 

25 pemphigoid, discoid lupus, ulcerative colitis, and dense deposit disease. The diseases set 
forth above, as referred to herein, include those exhibited by animal models for such 
diseases, such as, for example non-obese diabetic (NOD) mice for IDDM and experimental 
autoimmune encephalomyelitis (EAE) mice for multiple sclerosis. 

The methods of the present invention can be used to treat such autoimmune diseases 

30 by reducing or eliminating the immune response to the patient's own (self) tissue, or, 
alternatively, by reducing or eliminating a pre-existing autoimmune response directed at 
tissues or organs transplanted to replace self tissues or organs damaged by the autoimmune 
response. 

35 
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5.5 TARGET INFECTIOUS DISEASES 

The infectious diseases that can be treated or prevented using the methods and 
compositions of the present invention include thole caused by intracellular pathogens such as 
viruses, bacteria, protozoans, and intracellular parasites. Viruses include, but are not limited 

5 to viral diseases such as those caused by hepatitis type B virus, parvoviruses, such as 
adeno-associated virus and cytomegalovirus, papovaviruses such as papilloma virus, 
polyoma viruses, and SV40, adenoviruses, herpes viruses such as herpes simplex type I 
(HSV-I), herpes simplex type n (HSV-II), and Epstein-Barr virus, poxviruses, such as 
variola (smallpox) and vaccinia virus, RlfA viruses, including but not limited to human 

10 immunodeficiency virus type I (HIV-I), human immunodeficiency virus type II (HTV-II), 
human T-cell lymphotropic virus type I (HTLV-I), and human T-cell lymphotropic virus 
type II (HTLV-II); influenza virus, measles virus, rabies virus, Sendai virus, picomaviruses 
such as poliomyelitis virus, coxsackieviruses, rhinoviruses, reoviruses, togaviruses such as 
rubella virus (German measles) and Semliki forest virus, arboviruses, and hepatitis type A 

15 virus. 

In another embodiment, bacterial infections can be treated or prevented such as, but 
not limited to disorders caused by pathogenic bacteria including, but not limited to, 
Streptococcus pyogenes, Streptococcus pneumoniae, Neisseria gonorrhoea, Neisseria 
meningitidis, Corynebacterium diphtheriae, Clostridium botulinum, Clostridium perfringens, 

20 Clostridium tetani, Haemophilus influenzae, Klebsiella pneumoniae, Klebsiella ozaenae, 
Klebsiella rhinoscleromotis, Staphylococcus aureus, Vibrio cholerae, Escherichia coli, 
Pseudomonas aeruginosa, Campylobacter (Vibrio) fetus , Campylobacter jejuni, Aeromonas 
hydrophila, Bacillus cereus, Edwardsiella tarda, Yersinia enterocolitica, Yersinia pestis, 
Yersinia pseudotuberculosis, Shigella dysenteriae, Shigella jlexneri, Shigella sonnei, 

25 Salmonella typhiimurium, Salmonella typhii, Treponema pallidum, Treponema pertenue, 
Treponema carateneum, Borrelia vincentii, Borrelia burgdorferi, Leptospira 
icterohemorrhagiae, Mycobacterium tuberculosis, Toxoplasma gondii, Pneumocystis carinii, 
Francisella tularensis, Brucella abortus, Brucella suis, Brucella melitensis, Mycoplasma 
spp., Rickettsia prowazeki, Rickettsia tsutsugumushi, Chlamydia spp., and Helicobacter 

30 pylori. 

In another preferred embodiment, the methods can be used to treat or prevent 
infections caused by pathogenic protozoans such as, but not limited to, Entomoeba 
histolytica, Trichomonas tenas, Trichomonas hominis, Trichomonas vaginalis, Trypanosoma 
gambiense, Trypanosoma rhodesiense, Trypanosoma cruzi, Leishmania donovani, 
35 Leishmania tropica, Leishmania braziliensis, Pneumocystis pneumonia, Plasmodium vivax, 
Plasmodium falciparum, and Plasmodium malaria. 
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5.6 TARGET PROLIFERATIVE CELL DISORDERS 

With respect to specific proliferative and oncogenic disease associated with a2M- 

5 a2M activity, the diseases that can be treated or prevented by the methods of the present 
invention include, but are not limited to: human sarcomas and carcinomas, e.g., 
fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, 
angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangjoendotheliosarcoma, 
synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon 

10 carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell 
carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland 
carcinoma, papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary 
carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, 
choriocarcinoma, seminoma, embryonal carcinoma, Wilms 1 tumor, cervical cancer, testicular 

15 tumor, lung carcinoma, small cell lung carcinoma, bladder carcinoma, epithelial carcinoma, 
glioma, astrocytoma, medulloblastoma, craniopharyngioma, ependymoma, pinealoma, 
hemangioblastoma, acoustic neuroma, oligodendroglioma, meningioma, melanoma, 
neuroblastoma, retinoblastoma; leukemias, e.g., acute lymphocytic leukemia and acute 
myelocytic leukemia (myeloblastic, promyelocyte, myelomonocytic, monocytic and 

20 erythroleukemia); chronic leukemia (chronic myelocytic (granulocytic) leukemia and chronic 
lymphocytic leukemia); and polycythemia vera, lymphoma (Hodgkin's disease and non- 
Hodgkin's disease), multiple myeloma, WaldenstrSm's macroglobulinemia, and heavy chain 
disease. 

Diseases and disorders involving a deficiency in cell proliferation or in which cell 
25 proliferation is desired for treatment or prevention, and that can be treated or prevented by 
inhibiting the a2M function, include but are not limited to degenerative disorders, growth 
deficiencies, hypoproliferative disorders, physical trauma, lesions, and wounds; for example, 
to promote wound healing, or to promote regeneration in degenerated, lesioned or injured 
tissues, etc. 

30 

5.7 DOSAGE REGIMENS AND FORMULATION 

Covalent or noncovalent complexes of a2M polypeptides and antigenic molecules of 
the invention may be formulated into pharmaceutical preparations for administration to 
mammals for treatment or prevention of infectious diseases or cancer at therapeutically 
effective doses for immunotherapy. 
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Selection of the preferred effective dose will be determined by a skilled artisan based 
upon considering several factors which will be known to one of ordinary skill in the art. 
Such factors include the particular form of a2M, and its pharmacokinetic parameters such as 

5 bioavailability, metabolism, half-life, etc. , which will have been established during the usual 
development procedures typically employed in obtaining regulatory approval for a 
pharmaceutical compound. Further factors in considering the dose include the condition or 
disease to be treated or the benefit to be achieved in a normal individual the body mass of 
the patient, the route of administration, whether administration is acute or chronic, 

10 concomitant medications, and other factors well known to affect the efficacy of administered 
pharmaceutical agents. Thus the precise dosage should be decided according to the 
judgment of the practitioner and each patient's circumstances, e.g., depending upon the 
condition and the immune status of the individual patient, according to standard clinical 
techniques. 

15 Covalent or noncovalent complexes of a2M polypeptides and antigenic molecules of 

the invention may be formulated into pharmaceutical preparations for administration to 
mammals for treatment or prevention of infectious diseases or cancer. Drug solubility and 
the site of absorption are factors which should be considered when choosing the route of 
administration of a therapeutic agent. 

20 a2M polypeptide-antigenic molecule complexes of the invention may optionally be 

administered with one or more adjuvants in order to enhance the immunological response. 
For example, depending on the host species, adjuvants which may be used include, but are 
not limited to: mineral salts or mineral gels such as aluminum hydroxide, aluminum 
phosphate, and calcium phosphate; surface active substances such as lysolecithin, pluronic 

25 polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanins, and dinitrophenol; 
immunostimulatory molecules, such as cytokines, saponins, muramyl dipeptides and 
tripeptide derivatives, CpG dinucleotides, CpG oligonucleotides, monophosphoryl Lipid A, 
and polyphosphazenes; particulate and microparticulate adjuvant, such as emulsions, 
liposomes, virosomes, cochleates; or an immune stimulating complex mucosal adjuvants, 

30 Freund's (complete and incomplete, and potentially useful human adjuvants such as BCG 
(bacille Calmette-Guerin) and corynebacterium parvum,). 

a2M polypeptide-antigenic molecule complexes of the invention may be 
administered using any desired route of administration, including but not limited to, e.g., 
subcutaneoUsly, intravenously or intramuscularly, although intradermally or mucosally is 

35 preferred. Advantages of intradermal or mucosal administration include use of lower doses 
and rapid absorption, respectively. Mucosal routes of administration include, but are not 
limited to, oral, rectal and nasal administration. Preparations for mucosal administrations are 
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suitable in various formulations as described below. The route of administration can be 
varied during a course of treatment. 

The doses recited above are preferably given once weekly for a period of about 4-6 

5 weeks, and the mode or site of administration is preferably varied with each administration. 
In a preferred example, subcutaneous administrations are given, with each site of 
administration Varied sequentially. Thus, by way of example and not limitation, the first 
injection may be given subcutaneously on the left arm, the second on the right arm, the third 
on the left belly, the fourth on the right belly, the fifth on the left thigh, the sixth on the right 

jq thigh, etc. The same site may be repeated after a gap of one or more injections. Also, split 
injections may be given. Thus, for example, half the dose may be given in one site and the 
other half on an other site on the same day. 

Alternatively, the mode of administration is sequentially varied, e.g., weekly 
injections are given in sequence subcutaneously, intramuscularly, intravenously or 

j5 intraperitoneally. 

After 4-6 weeks, further injections are preferably given at two-week intervals over a 
period of time of one month. Later injections may be given monthly. The pace of lata: 
injections may be modified, depending upon the patient's clinical progress and 
responsiveness to the immunotherapy. 

20 Compositions comprising covalent or noncovalent complexes formulated in a 

compatible pharmaceutical carrier may be prepared, packaged, and labeled for treatment of 
the indicated infectious disease or tumor. In preferred aspects, an amount of <x2M 
polypeptide - antigenic molecule complex is administered to a human that is in the range of 
about 2 to 150 ^g, preferably 2 to 50 ng, most preferably about 25 \ig, given once weekly for 

25 about 4-6 weeks, intradermally with the site of administration varied sequentially. 

If the complex is water-soluble, then it may be formulated in an appropriate buffer, 
for example, phosphate buffered saline or other physiologically compatible solutions. 
Alternatively, if the resulting complex has poor solubility in aqueous solvents, then it may be 
formulated with a non-ionic surfactant such as Tween, or polyethylene glycol. Thus, the 

3Q covalent or noncovalent complexes and their physiologically acceptable solvates may be 
formulated for administration by inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral, rectal administration or, in the case of tumors, directly 
injected into a solid tumor. 

For oral administration, the pharmaceutical preparation may be in liquid form, for 

35 example, solutions, syrups or suspensions, or may be presented as a drug product for 

reconstitution with water or other suitable vehicle before use. Such liquid preparations may 
be prepared by conventional means with pharmaceutically acceptable additives such as 



-55- 



WO 01/91787 PCT/US01/18047 



suspending agents (e.g., sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 
emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., almond oil, oily 
esters, or fractionated vegetable oils); and preservatives (e.g., methyl or propyl-p- 

5 hydroxybenzoates or sorbic acid). The pharmaceutical compositions may take the form of, 
for example, tablets or capsules prepared by conventional means with pharmaceutical^ 
acceptable excipients such as binding agents (e.g., pregelatinized maize starch, polyvinyl 
pyrrolidone or hydroxypropyl methylcellulose); fillers (e.g., lactose, microcrystalline 
cellulose or calcium hydrogen phosphate); lubricants (e.g., magnesium stearate, talc or 

j,q silica); disintegrants (e.g., potato starch or sodium starch glycolate); or wetting agents (e.g., 
sodium lauryl sulphate). The tablets may be coated by methods well-known in the art. 

Preparations for oral administration may be suitably formulated to give controlled 
release of the complexes. Such compositions may take the form of tablets or lozenges 
formulated in conventional manner. 

1 5 For administration by inhalation, the complexes may be conveniently delivered in the 

form of an aerosol spray presentation from pressurized packs or a nebulizer, with the use of a 
suitable propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a pressurized 
aerosol the dosage unit may be determined by providing a valve to deliver a metered amount. 

20 Capsules and cartridges of, e.g., gelatin for use in an inhaler or insufflator may be formulated 
containing a powder mix of the complexes and a suitable powder base such as lactose or 
starch. 

The complexes may be formulated for parenteral administration by injection, e.g., by 
bolus injection or continuous infusion. Formulations for injection may be presented in unit 

25 dosage form, e.g., in ampoules or in multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous 
vehicles, and may contain formulatory agents such as suspending, stabilizing and/or 
dispersing agents. Alternatively, the active ingredient may be in powder form for 
constitution with a suitable vehicle, e.g., sterile pyrogen-free water, before use. 

20 The complexes may also be formulated in rectal compositions such as suppositories 

or retention enemas, e.g., containing conventional suppository bases such as cocoa butter or 
other glycerides. 

In addition to the formulations described previously, the complexes may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
35 implantation (for example, subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the complexes may be formulated with suitable polymeric or hydrophobic 
materials (for example, as an emulsion in an acceptable oil) or ion exchange resins, or as 
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sparingly soluble derivatives, for example, as a sparingly soluble salt. Liposomes and 
emulsions are well known examples of delivery vehicles or carriers for hydrophilic drugs. 

The complexes may, if desired, be presented in a pack or dispenser device which may 

5 contain one or more unit dosage forms containing the covalent or noncovalent complexes. 
The pack may for example comprise metal or plastic foil, such as a blister pack. The pack or 
dispenser device may be accompanied by instructions for administration. 

The invention also provides kits for carrying out the therapeutic regimens of the 
invention. Such kits comprise in one or more containers therapeutically or prophylactically 

JO effective amounts of the covalent or noncovalent a2M polypeptide - antigenic molecule 
complexes in pharmaceutically acceptable form. The a2M polypeptide - antigenic molecule 
complexes in a vial of a kit of the invention may be in the form of a pharmaceutically 
acceptable solution, e.g., in combination with sterile saline, dextrose solution, or buffered 
solution, or other phaimaceutically acceptable sterile fluid. Alternatively, the complex may 

j 5 be lyophilized or desiccated; in this instance, the kit optionally further comprises in a 
container a pharmaceutically acceptable solution saline, dextrose solution, etc.), 
preferably sterile, to reconstitute the complex to form a solution for injection purposes. 

In another embodiment, a kit of the invention further comprises a needle or syringe, 
preferably packaged in sterile form, for injecting the complex, and/or a packaged alcohol 

2Q pad. Instructions are optionally included for administration of a2M polypeptide - antigenic 
molecule complexes by a clinician or by the patient. 



6. EXAMPLE: a2M ANTAGONIZES HSP-MEBIATEB ANTIGEN 
PRESENTATION VIA THE a2M RECEPTOR 

25 6.1 INTRODUCTION 

The Example presented herein describes the successful identification of an interaction 
between gp96 and the a2M receptor present in macrophages and dendritic cells in vivo, and 
the blocking of this interaction by a2M. The experiments presented herein form the basis for 
the compositions and therapeutic methods of the present invention which relate to the use of 
a2M polypeptide-antigenic molecule complexes as immunotherapeutic agents for treatment 
of immune disorders, proliferative disorders, and infectious diseases. 

The Applicant of the present invention noted that certain observations were 
inconsistent with a "direct transfer" model of HSP-chaperoned peptide antigen presentation. 
3 5 First, the immunogenicity of HSP preparations is dependent on the presence of functional 
phagocytic cells but not B cells or other nonprofessional antigen-presenting cells (Udono and 
Srivastava, 1993, supra; Suto and Srivastava, 1995, supra), whereas free peptides can 
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sensitize all cell types. Second, extremely small quantities of HSP-peptide complexes were 
effective in eliciting specific immunity, z.e., gp96-chaperoned peptides are several hundred 
times as effective as free peptides in sensitizing macrophages for CTTL* recognition, 

5 suggesting the possibility of a specific uptake mechanism. Third, gp96-chaperoned peptides 
elicited an MHC I response that was not limited by the size of peptide. Finally, the 
processing of gp96-peptide complexes in macrophage was found to be sensitive to Brefeldin 
A (BFA), which blocks transport through the Golgi apparatus, suggesting that processing 
occurred through an intercellular mechanism. These observations led to the hypothesis that 

jq HSP-chaperoned peptides may be processed internally and re-presented by MHC class I 
molecules on the cell surfaces of macrophages (Suto and Srivastava, 1995, supra). There is 
also the hypothesis that the mannose receptor is used in the uptake of gp96 but no 
mechanism has been proposed for the non-glycosylated HSPs, such as HSP70 (Ciupitu et aL, 
1998, J. Exp. Med., 187: 685-691). Others suggested that anovel intracellular trafficking 

j 5 pathway may be involved for the transport of peptides from the extracellular medium into the 
lumen of ER) Day etal, 1997, Proc. Natl. Acad. Sci. 94:8065-8069; Nicchitta, 1998, Curr. 
Opin. in Immunol. 10:103-109). Further suggestions include the involvement of phagocytes 
which (a) possess an ill-defined pathway to shunt protein from the phagosome into 
thecytosol where it would enter the normal class I pathway; (b) digest ingested material in 

20 lysosomes and regurgitate peptides for loading on the surface to class I molecules (Bevan, 
1995, J. Exp. Med. 192:639-41). The discovery of a receptor for heat shock protein as 
disclosed herein helps to resolve the paradox of how extracellular antigenic peptides 
complexed to HSPs can be presented by MHC class I molecules on antigen presenting cells. 

25 6.2 MATERIALS AND METHODS 

Affinity chromatography. Proteins (lmg) in 2ml volume were incubated with 2ml of 
equilibrated AmmoLink beads in PBS with a reductant (NaCNBH 3 ) for 1 hour. Uncoupled 
protein was removed by extensive washing of the column and unreactive groups quenched. 
Immobilization yields were typically >92% of the starting amount of protein. Columns were 

30 stored at 4°C until used. Such columns were made with gp96 (purified as described in 
Srivastava et al 9 1986, Proc. Natl Acad. Sci., U.S.A. 83:3407-341 1) and albumin. For 
membrane purification, cells were lysed by dounce homogenization in hypotonic buffer 
containing PMSF. Unlyzed cells and nuclei were removed by centrifugation at lOOOg for 5 
mm. The postnuclear supernatant was centrifuged at 100,000g for 90 mins. The pellet 

35 contains total membranes and was fractionated by aqueous two-phase partition with a 
dextran/polyethylene glycol biphase. Briefly membranes were resuspended in PEG (33% 
wt/wt in 0.22 M sodium phosphate buffer, pH 6.5) and underlaid gently with dextran 
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(20%wt/wt in 0.22M sodium phosphate buffer, pH 6.5). The two phases were mixed gently 
and centrifuged at 2000 g for 15 mins. The white material at the interphase was enriched for 
plasma membranes, whose proteins were extracted by 2 hr incubation in 20mM Tris buffer 

5 (pH8, containing 0.08% octylglucoside) at 4°C. 

Photo cross-linking of gp96 to putative receptor. The cross-linker (SASD, (Pierce) 
was labeled with I 125 using iodobeads (Pierce). Radiolabeled SASD was covalently attached 
to gp96 by incubation at room temperature for 1 hr. Free SASD and I 125 were removed by 
size exclusion column (KwikSep columns, Pierce). For cross-linking studies, I 125 -SASD- 

10 §P 96 ^8 SP 9 ^) was incubated with purified CD1 lb + cells. Unbound protein was 
removed by washing. All procedures to this point were carried out in very dim light. 
Proteins were cross-linked with UV light Cells were lysed with lysis buffer (0.5%NP4O, 
lOmM Tris, ImMEDTA, 150mM NaCl) and treated with 100 mM 2-mercaptoethanol to 
cleave the cross-linker. Cell ly sates were analyzed by SDS-PAGE and autoradiography. 

j 5 Re-presentation assays. Re-presentation assays were carried out as described (Suto 

and Srivastava, 1995, Science 269:1585-1588). Antigen presenting cells (RAW264.7 
macrophage cell line) were plated at a 1 : 1 ratio with AH I -specific T cells in complete 
RPML Approximately 10,000 cells of each type were used. Gp96 (10 ng/ml) chaperoning 
the AH1-20 mer peptide (RVTYHSPSYVYHQFERRAK) was added to the cells and the 

2Q entire culture was incubated for 20 hrs. Stimulation of T cells was measured by quantifying 
the amount of IFN-y released into the supernatants by ELISA (Endogen). 

Protein Microsequencing. Proteins identified by affinity chromatography were 
analyzed on SDS-PAGE and stained with coomasie blue or transferred onto PVDF 
membrane and stained with coomasie blue^ all of it under keratin-free conditions. Protein 

25 bands were excised with a razor from the gel or membrane. Tryptic peptides from an 80kDa 
faint coomassie band were extracted by 50% acetonitrile, 5% formic acid, dried, and loaded 
onto a 75 m 10 cm, reverse-phase CI 8, microcapillary column (3 ^1 vol) and tryptic peptides 
were separated by on-line microcapillary liquid chromatography-tendem mass spectrometry 
followed by database searching using the SEQUEST program as previously described. 

30 (Gatlin et a/.,2000, Anal. Chem. 72:757-63; Link et al. 9 1999, Nat. Biotechnol. 17:676-82). 
The analysis was carried out in a data-dependent auto-MS/MS fashion using a Finnigan LCQ 
iontrap Mass Spectrometer. 



35 
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6.3 RESULTS 

Identification of an 80 kDa protein as a potential gp96 receptor. Homogenous 
preparations of gp96 were coupled to FITC and the gp96-FITC was used to stain RAW264.7 

5 cells, shown to be functionally capable of re-presenting gp96-chaperoned peptides. Gp96- 
FITC but not control albumin-FITC preparations stained the cell surface of RAW264.7 cells 
(FIG. 1 A). Plasma membrane preparations of cell surface-biotinylated RAW264.7 cells were 
solubilized in 0.08% octyl-glucoside and the soluble extract was applied to a gp96-Sepharose 
column. The bound proteins were eluted with 3M sodium chloride. SDS-PAGE analysis of 
the eluate showed 2 major bands of -75-80 kDa size (FIG. IB, top left). Blotting of this gel 
with avidin-peroxidase showed that both bands were biotinylated, indicating their surface 
localization (FIG. IB, bottom left). Affinity purification of membrane extracts of RAW264.7 
cells over control serum albumin affinity columns did not result in isolation of any proteins, 
nor did probing of immunoblots of such gels with avidin peroxidase detect any albumin- 

15 binding surface proteins (FIG. IB, top and bottom center lanes). As an additional control, 
chromatography of membrane extracts of P815 cells which do not bind gp96-FTTC and 
which do not re-present gp96-chaperoned peptides, on gp96 affinity columns did not result in 
elution of any gp96-binding proteins (FIG. IB, top and bottom right lanes). 

In parallel experiments, gp96 molecules were coupled to the radio-iodinated linker 

20 sulfosuccinimidyl (4-azidosalicylamido) hexanoate (SASD) which contains a photo cross- 
linkable group. Gp96-SASD-I 125 was pulsed onto peritoneal macrophages, which have been 
shown previously to re-present gp96-chaperoned peptides (Suto and Srivastava, 1995, 
Science 269:1585-1588). Excess gp96-SASD was removed by multiple rounds of washing 
of the cells and photoactivation was carried out by exposure of cells to UV light for 10 mm. 

25 Cell Iysates were reduced in order to transfer the I 125 group to the putative gp96 ligand and 
were analyzed by SDS- PAGE followed by autoradiography. The gp96 molecule was 
observed to cross-link to an -80 kDa band specifically present in re-presentation-competent 
macrophage but not in the re-presentation-incompetent P815 cells (FIG. 1C). This band 
appears to correspond in size to the larger of the two bands seen in eluates of gp96 affinity 

30 columns (FIG. ID). No band corresponding to the lower band in that preparation is seen in 
the photo cross-linked preparation. The observation of a specific binding of gp96 to an 80 
kDa protein in two different re-presentation-competent cell types, but not in a re- 
presentation-incompetent cell line, and by two independent assays supported the candidacy 
of the 80 kDa molecule for the gp96 receptor. 

35 Antiserum against the 80 kDa protein inhibits re-presentation of a gp96-chaperoned 

antigenic peptide. The eluates containing the 75-80 kDa proteins were used to immunize a 
New Zealand white rabbit, and pre-immune and immune sera were used to probe blots of 



-60- 



WO 01/91787 PCT/US01/18047 



plasma membrane extracts of the re-presentation-competent RAW264.7 and primary 
peritoneal macrophages and the re-presentation-incompetent P815 cells. The immune but not 
the pre-immune serum detected the 80 kDa band (and a faint lower 75 kDa band) in plasma 

5 membrane extracts of primary macrophage and the RAW264.7 membranes but not of P8 1 5 
cells (FIG. 2A). The pre-immune and immune sera were tested in a functional assay for their 
ability to block re-presentation of gp96-chaperoned peptides. The L d -restricted epitope AH1 
derived from the gp70 antigen of murine colon carcinoma CT26 (Huang et aL, 1996, Proc. 
Natl Acad. Sci. U.S.A. 93:9730-9735) was used as the model system. Complexes of gp96 

10 with an AH1 precursor (used to inhibit direct presentation) were pulsed onto RAW264.7 
cells which were used to stimulate a L d /AHl-specific CD8+ T cell clone. Release of 
interferon-y by the clones was measured as a marker of their activation. RAW264.7 cells 
were able to re-present gp96-chaperoned AH1 precursor effectively in this assay. It was 
observed that at the highest concentration, the immune sera inhibited re-presentation 

15 completely (FIG. 2B). Although the pre-immune serum was ineffective in inhibiting re- 
presentation as compared to the immune sera, it did inhibit re-presentation significantly at 
higher concentrations. The significance of tins observation became clear later when we 
determined the identity of the gp96 receptor. Repeated immunizations with the affinity- 
purified gp96-binding proteins did not result in corresponding increase in antibody titers. 

20 Identification of the 80 kDa protein as an amino terminal fragment of the heavy chain 

of the a2M receptor. The 80 kDa protein eluted from the gp96 affinity column was resolved 
on SDS-PAGE and visualized by staining with Coomassie Brilliant Blue. The protein band 
was subjected to in-gel trypsin digestion and mass spectrometry-based protein 
microsequencing as described in the methods in Section 6.2. Four independent tryptic 

25 peptides corresponding to N-teiminal region of the <x2-macroglobulin (a2M) receptor, 
designated by immunologists as CD91, were identified (FIG. 3C). 

a2M inhibits re-presentation of a gp96-chaperoned antigenic peptide by RAW264, 7, 
<x2M receptor is one of the known natural ligands for the a2M receptor. Its ability to inhibit 
re-presentation of gp96-chaperoned antigenic peptide AH1 was tested in the assay described 

30 in FIG. 2. a2M but not control proteins selectin (CD62) or serum albumin was observed to 
inhibit re-presentation completely and titratably (FIG. 4). This observation was also 
consistent with the result in FIG. 2 that while the pre-immune serum did not detect an 80 
kDa band in plasma membranes of RAW264.7 cells, it did inhibit re-presentation to some 
degree at high concentrations. Thus, by structural as well as functional criteria, the a2M 

35 receptor was determined to fulfill the criteria essential for a receptor for gp96. 
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6.4 DISCUSSION 

The a2M receptor, which is also designated CD91, was initially identified as a 
protein related to the low density lipoprotein (LDL) receptor Related Protein (LRP) 

5 (Slrickland et al, 1990, J. Biol. Chem.265: 17401-17404; Kristensen etal, 1990, FEBS Lett. 
276:151-155). The protein consists of an -420 kDa a subunit, an 85 kDa p subunit and a 39 
kDa tightly associated molecule (RAP). The a and P subunits are encoded by a single 
transcript of -45 Kb in size (Van Leuven et al, 1993, Biochim. Biophys. Acta. 1173:71-74. 
The receptor has been shown to be present in cells of the monocytic lineage and in 

IQ hepatocytes, fibroblasts and keratinocytes. CD91 has been shown previously to bind the 
activated form of the plasma glycoprotein a2M, which binds to and inhibits a wide variety of 
endoproteinases. a2M receptor also binds to other ligands such as transforming growth 
factor P (O Connor-McCourt etal, 1987, J. Biol. Chem. 262:14090-14099), platelet-derived 
growth factor (Huang et al, 1984, Proc. Natl. Acad. Sci. U.S.A. 81:342-346), and fibroblast 

15 growth factor pennis et al, 1989, J. Biol. Chem. 264:721 0-7216). a2M is thus believed to 
regulate, and specifically diminish, the activities of its various ligands. Complexed with 
these various ligands, o2M binds a2M receptor on the cell surface and is internalized through 
receptor-mediated endocytosis. Uptake of a2M-complexed ligands has been assumed thus 
far to be the primary function of the a2M receptor, although a role for it in lipid metabolism 

20 is also assumed. a2M receptor ligands other than <x2M, such as tissue-specific plasminogen 
activator-inhibitor complex (Orth et al, 1992, Proc. Natl. Acad. Sci. U.S.A. 89:7422-7426) 
and urokinase-PAIl complex (Nykjaer et al, 1992, J. Biol. Chem. 267:14543-14546), have 
been identified. These ligands attest to a role for a2M receptor in clearing a range of 
extracellular, plasma products. 

25 The studies reported here show that the heat shock protein gp96 is an additional 

ligand for the a2M receptor. The human gp96-coding gene has been mapped previously by 
us at chromosome 12 (q24.2-+q24.3) (Maki et al, 1993, Somatic Cell Mol. Gen. 19:73-81). 
It is of interest in this regard that the a2M receptor gene has been mapped to the same 
chromosome and at a not too distant location (ql3-+ql4) (Hilliker et al Genomics 13:472- 

30 474). Gp96 binds a2M receptor directly and not through other ligands such as a2M. 

Homogenous preparations of gp96, in solution, or cross-linked to a solid matrix, bind to the 
a2M receptor. Indeed, the major ligand for the a2M receptor, <x2M, actually inhibits 
interaction of gp96 with a2M receptor, instead of promoting it, providing evidence that gp96 
is a direct ligand for the <x2M receptor. The 80 kDa protein shown to bind gp96 is clearly an 

35 amino terminal degradation product of the a subunit of the a2M receptor. Degradation 
products of the a2M receptor in this size range have also been observed in previous studies 
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(Jensen etal, 1989, Biochem. Arch. 5 :171-176), and may indicate the existence of a discrete 
ectodomain in the a2M receptor which may be particularly sensitive to proteolytic cleavage. 
. As shown here, the gp96- <x2M receptor interaction provides a new type of function 

5 for a2M receptor, a function of a sensor, not only of the extracellular environment with its 
previously known plasma-based ligands, but also a sensor of the intracellular milieu as well. 
HSPs such as gp96 are obligate intracellular molecules and are released into the extracellular 
milieu only under conditions of necrotic (but not apoptotic) cell death. Thus, the a2M 
receptor may act as a sensor for necrotic cell death (see FIG. 5), just as the scavenger 

10 receptor CD36 and the recently identified phosphatidyl serine-binding protein act as sensors 
of apoptotic cell death and receptors for apoptotic cells (Savili et aL 9 1992, J. Clin. 
Invest.90:1513-1522; Fadok et al. 9 2000, Nature 405:85-90). Interaction of the macrophages 
with the apoptotic cells leads to a down-regulation of the inflammatory cytokines such as 
TNF (Fadok et aL, 2000, supra), while gp96-APC interaction leads to re-presentation of 

15 gp96-chaperoned peptides by MHC I molecules of the APC, followed by stimulation of 
antigen-specific T cells (Suto and Srivastava, 1995, supra) and, in addition, secretion of pro- 
inflammatory cytokines such as TNF, GM-CSF and BU12. Interestingly, a2M, an 
independent figand for the a2M receptor, inhibits representation of gp96-chaperoned 
peptides by macrophages. This observation suggests that re-presentation of gp96-chaperoned 

20 peptides can not occur physiologically in blood, but only within tissues as a result of 

localized necrotic cell death. This is consistent with the complete absence of gp96 or other 
HSPs in blood under all conditions tested. Together, these observations point towards a 
possible mechanism whereby the release of HSPs in the blood as a result of severe tissue 
injury and lysis will not lead to a systemic and lethal pro-inflammatory cytokine cascade. 

25 It is possible, therefore, that the a2M receptor renders it possible for the APCs to 

sample (i) the extracellular milieu of the blood through <x2M and other plasma ligands and 
(ii) the intracellular milieu of the tissues through HSPs, particularly of the gp96 family. The 
former permits APCs to implement their primordial phagocytic function, while the latter 
allows them to execute its innate and adaptive immunological functions. Viewed in another 

30 perspective, recognition of apoptotic cells by APCs through CD36 or phophatidyl serine, 
leads to anti-inflammatory signals, while interaction of the APC with necrotic cells through 
a2M receptor leads to pro-inflammatory innate and adaptive immune responses (see 
Srivastava et al, 1998, Immunity 8: 657-665). 

35 
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The invention is not to be limited in scope by the specific embodiments described 
which are intended as single illustrations of individual aspects of the invention, and 
functionally equivalent methods and components are within the scope of the invention. 
5 Indeed various modifications of the invention, in addition to those shown and described 
herein will become apparent to those skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of the 
appended claims. 

All references cited herein, including patent applications, patents, and other 
jq publications, are incorporated by reference herein in their entireties for all purposes. 
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WHAT IS CLAIMED IS : 

.1 . A pharmaceutical composition comprising an amount of a molecular complex 
effective for treatment or prevention of an infectious disease or cancer, and a 
pharmaceutically acceptable carrier, said molecular complex comprising an alpha (2) 
^ macroglobulin polypeptide noncovalently associated with an antigenic molecule which 
displays the antigenicity of an antigen of an infectious agent or an antigen overexpressed in a 
cancer cell relative to its expression in a noncancerous cell of said cell type. 

2. The pharmaceutical composition of Claim 1 wherein the antigenic molecule 
10 displays the antigenicity of an antigen of an infectious agent. 

3 . The pharmaceutical composition of Claim 1 wherein the antigenic molecule 
displays the antigenicity of an antigen overexpressed in a cancer cell relative to its expression 
in a noncancerous cell of said cell type. 

15 

4. The pharmaceutical composition of Claim 1 wherein the antigenic molecule is 
a tumor specific antigen or a tumor-associated antigen. 

5. A pharmaceutical composition comprising an amount of a fusion protein 
effective for treatment or prevention of an infectious disease or cancer, and a 
pharmaceutically acceptable carrier, said fusion protein comprised of an alpha (2) 
macroglobulin polypeptide and an antigenic molecule which displays the antigenicity of an 
antigen of an infectious agent or an antigen overexpressed in a cancer cell relative to its 
expression in a noncancerous cell of said cell type. 

25 

6. The pharmaceutical composition of Claim 1 wherein the molecular complex is 
purified, 

7. A purified molecular complex comprising an alpha (2) macroglobulin 

3° polypeptide noncovalently associated with an antigenic molecule of an infectious agent or an 
antigen overexpressed in a cancer cell relative to its expression in a noncancerous cell of said 
cell type. 



35 



8. A purified population of molecular complexes in which at least 65% of said 
complexes comprise an alpha (2) macroglobulin noncovalently associated with an antigenic 
molecule. 
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9. A purified population of molecular complexes purified from a recombinant 
cell in which at least 65% of said complexes comprise an alpha (2) macroglobulin 
noncovalently associated with an antigenic molecule. 

5 

10. A recombinant cell infected with a pathogen and transformed with a nucleic 
acid comprising a nucleotide sequence that (i) is operably linked to a promoter, and (ii) 
encodes an alpha (2) macroglobulin polypeptide, which alpha (2) macroglobulin polypeptide 
associates with an antigenic molecule, when said antigenic molecule is present, to foim a 

2Q complex that in sufficient amount is capable of eliciting an immune response to the antigenic 
molecule. 

11. A recombinant cancer cell transformed with a nucleic acid comprising a 
nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) 

j 5 macroglobulin polypeptide, which alpha (2) macroglobulin polypeptide associates with an 
antigenic molecule, when said antigen is present, to form a complex that in sufficient amount 
is capable of eliciting an immune response to the antigenic molecule. 

12. A recombinant cell transformed with (i) a first nucleic acid comprising a first 
20 nucleotide sequence that is operably linked to a first promoter and encodes an alpha (2) 

macroglobulin polypeptide, and (ii) a second nucleic acid comprising a second nucleotide 
sequence that is operably linked to a second promoter and encodes an antigenic molecule, 
such that the alpha (2) macroglobulin polypeptide and the antigenic molecule are expressed 
within the cell and associate with each other to foim a complex that in sufficient amount is 
25 capable of eliciting an immune response to the antigenic molecule. 

1 3 . The recombinant cell of Claim 1 0, 1 1 , or 1 2 which is a human cell. 

14. A pharmaceutical composition comprising the recombinant cell of any one of 
20 Claims 10, 1 1, or 12 and a pharmaceutical^ acceptable carrier. 

15. A method for preparing a complex of an alpha (2) macroglobulin polypeptide 
noncovalently associated with an antigenic molecule, said alpha (2) macroglobulin 
polypeptide comprising: 

35 (a) culturing a cell transformed with a nucleic acid comprising a 

nucleotide sequence encoding the alpha (2) macroglobulin 
polypeptide, under conditions such that said alpha (2) macroglobulin 
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polypeptide is expressed by the cells and associates with an antigenic 
molecule of the cell; and 
(b) recovering a population of complexes of the alpha (2) macroglobulin 
5 polypeptide noncovalently associated with antigenic molecules from 

the cells. 

16. A method for preparing an alpha (2) macroglobulin polypeptide 
noncovalently associated with antigenic molecules derived from one or more antigens of an 

10 infectious agent, comprising: 

(a) culturing infected cells, transformed with a nucleic acid comprising a 
nucleotide sequence encoding the alpha (2) macroglobulin 
polypeptide, or fragment, analog, or variant thereof, and operably 
linked to a promoter, under conditions such that the alpha (2) 

Y 5 macroglobulin polypeptideis expressed by the cells and associates 

with peptides of the cells; and 

(b) recovering from the cells a population of complexes of the alpha (2) 
macroglobulin polypeptide noncovalently associated with peptides 
derived from the infectious agent. 

20 

17. The method of Claim 15 or 16, further comprising purifying the complexes. 

18. The method of Claim 15 or 16, further comprising purifying the complexes by 
affinity chromatography, 

25 

19. A method of treating or preventing an infectious disease in an individual 
having an infectious disease comprising administering to the individual an immunogenic 
amount of purified complexes comprising an alpha (2) macroglobulin polypeptide 
noncovalently associated with a first antigenic molecule, wherein the first antigenic molecule 

30 displays the antigenicity of an antigen of an infectious agent of the infectious disease. 

20. The method of Claim 1 9, further comprising, before, concurrently or after 
administration of the immunogenic complex* administering to the individual a composition 
comprising antigen presenting cells sensitized in vitro with a sensitizing amount of a second 

35 complex of alpha (2) macroglobulin polypeptide noncovalently associated with a second 
antigenic molecule, said second antigenic molecule displaying the antigenicity of a second 
antigenic molecule of said infectious agent. 
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21. A method of treating or preventing an infectious disease in a subject having 
an infectious disease comprising: 

(a) culturing an infected cell transformed with a nucleic acid comprising a 
nucleotide sequence encoding an alpha (2) macroglobulin polypeptide, 
said infected cell displaying the antigenicity of an antigen of an 
infectious agent of the infectious disease, said nucleotide sequence 
being operably linked to a promoter, under conditions such that the 
alpha (2) macroglobulin polypeptide is expressed by the infected cells 
and associates with antigenic molecules of the cell; 

(b) recovering complexes of the alpha (2) macroglobulin polypeptide 
noncovalently associated with antigenic molecules from the infected 
cell; and 

(c) administering to the subject an amount of the recovered complexes 
effective to treat or prevent the infectious disease. 

22. The method of Claim 21, further comprising, prior to step (a), the step of 
obtaining infected cells from the subject and transforming the infected cells with the nucleic 



23. The method of Claim 21, further comprising, prior to step (a), the step of 
obtaining the infected cell from one or more individuals and transforming the infected cells 
with the nucleic acid, said one or more individuals being different from the subject and 
having the same type of infectious disease as the subject. 



24. A method of treating or preventing an infectious disease in a subject having 
an infectious disease comprising: 



(a) culturing a recombinant cell transformed with (i) a first nucleic acid 
encoding an alpha (2) macroglobulin polypeptide and (ii) a second 
nucleic acid encoding an antigenic molecule displaying the 
antigenicity of an antigen of an infectious agent of the infectious 
disease; 

(b) recovering complexes of the alpha (2) macroglobulin polypeptide 
noncovalently associated with the antigen; and 

(c) administering to the subject an amount of the recovered complexes 
effective to treat or prevent the infectious disease. 



acid. 



20 



25 
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25 . The method of Claim 19, 2 1 , or 24, in which the infectious disease is caused 
by an infectious agent selected from the group consisting of a virus, a bacterium, a fungus, 
and a parasite. 

5 

26. A method of treating or preventing cancer in an individual having a type of 
cancer or in whom prevention of a type of cancer is desired comprising administering to the 
individual an immunogenic amount of purified complexes comprising an alpha (2) 
macroglobulin polypeptide noncovalently associated with a first antigenic molecule, wherein 

lO either (a) the first antigenic molecule displays antigenicity of said type of cancer or a 

metastasis thereof; or (b) the complex is obtained by recovering complexes from said type of 
cancer cells or a metastasis thereof that recombinantly express the alpha (2) macroglobulin 
polypeptide. 

15 27. The method of Claim 26, further comprising, before, concurrently or after 

administration of the immunogenic complex, administering to the individual a composition 
comprising antigen presenting cells sensitized in vitro with a sensitizing amount of a second 
complex of an alpha (2) macroglobulin noncovalently associated with a second antigenic 
molecule, said second antigenic molecule displaying the antigenicity of an antigen 

20 overexpressed in a cancer cell relative to its expression in a noncancerous cell of said cell 
type. 

28. The method of Claim 26, wherein the first antigenic molecule is an antigen 
overexpressed in a cancer cell relative to its expression in a noncancerous cell of said cell 

25 tyP e - 

29. The pharmaceutical composition of Claim 26, wherein the antigenic molecule 
is a tumor-specific antigen or a tumor-associated antigen. 

3Q 30. A method of treating or preventing cancer in a subj ect having a type of cancer 

or in whom prevention of a type of cancer is desired comprising: 

(a) culturing a cancer cell of said type transformed with a nucleic acid 
comprising a nucleotide sequence encoding an alpha (2) 
macroglobulin polypeptide said nucleotide sequence being operably 
35 linked to a promoter, under conditions such that the alpha (2) 

macroglobulin polypeptide is expressed by the cancer cell and 
associates with at least one antigenic molecule of the cell; 
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(b) recovering complexes of the alpha (2) macroglobulin polypeptide 
noncovalently associated with at least one antigenic molecule from the 
cancer cell; and 

(c) administering to the subject an amount of the recovered complexes 
effective to treat or prevent cancer. 



31. The method of Claim 30, further comprising, prior to step (a), the step of 
obtaining cancer cells from the subject and transforming the cancer cells with the nucleic 

10 acid, 

32. The method of Claim 30, further comprising, prior to step (a), the step of 
obtaining cancer cells from one or more individuals and transforming the cancer cells with 
the nucleic acid, said one or more individuals being different from the subject and having the 

!5 same type of cancer as the subject. 

33. A method of treating or preventing cancer in a subject having a type of cancer 
or in whom prevention of a type of cancer is desired comprising: 



(a) culturing a recombinant cell transformed with (i) a first nucleic acid 
encoding an alpha (2) macroglobulin polypeptide, and (ii) a second 
nucleic acid encoding an antigenic molecule displaying the 
antigenicity of an antigen of a cancer cell; 

(b) recovering complexes of the alpha (2) macroglobulin polypeptide 
noncovalently associated with the antigen; and 

(c) administering to the subject an amount of the recovered complexes 
effective to treat or prevent cancer. 



34. The method of Claim 26, 30, or 33, in which the type of cancer is selected 
from the group consisting of fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, 

30 osteogenic sarcoma, chordoma, angiosarcoma, endotheliosarcoma, lymphangiosarcoma, 
lymphangioendotheliosarcoma, synovioma* mesothelioma, Ewing f s tumor, leiomyosarcoma, 
rhabdomyosarcoma, colon carcinoma, pancreatic cancer, breast cancer, ovarian cancer, 
prostate cancer, squamous cell carcinoma, basal cell carcinoma, adenocarcinoma, sweat 
gland carcinoma, sebaceous gland carcinoma, papillary carcinoma, papillary 

35 adenocarcinomas, cystadenocarcinoma, medullary carcinoma, bronchogenic carcinoma, renal 
cell carcinoma, hepatoma, bile duct carcinoma, choriocarcinoma, seminoma, embryonal 
carcinoma, Wilms 1 tumor, cervical cancer, testicular tumor, lung carcinoma, small cell lung 
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carcinoma, bladder carcinoma, epithelial carcinoma, glioma, astrocytoma, medulloblastoma, 
craniopharyngioma, ependymoma, pinealdma, hemangioblastoma, acoustic neuroma, 
oligodendroglioma, meningioma, melanoma, neuroblastoma, retinoblastoma, leukemia, 
5 lymphoma, multiple myeloma, Waldenstrom's macroglobulinemia, and heavy chain disease. 

35. A method for treating an autoimmune disorder comprising administering to a 
mammal in need of such treatment an antibody specific for alpha (2) macroglobulin. 

10 36. The method of Claim 35, wherein the antibody is purified. 
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POSITION MH+ SEQUENCE 

509-518 955.0122 SGFSLGSDGK (SEQ ID NO: 54) 

328-337 973.1753 GIALDPAMGK (SEQ ID NO: 55) 

460-469 1152.3010 GGALHIYHQR (SEQ ID NO: 56) 

338-348 1315.5116 VFFTDYGQIPK (SEQ ID NO: 57) 

FIG.3C 
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CGCTGCTCCC CGCCAGTGCA CTGAGGAGGC GGAAACGGGG GAGCCCCTAG TGCTCCATCA 60 

GGCCCCTACC AAGGCACCCC CATCGGGTCC ACGCCGCCCA CCCCCCACCC CGCCTCCTCC 120 

CAATTGTGCA TTTTTGCAGC CGGAGTCGGC TCCGAGATGG GGCTGTGAGC TTCGCCCTGG 180 

GAGGGGGAGA GGAGCGAGGA GTAAAGCAGG GGTGMGGGT TCGAATTTGG GGGCAGGGGG 240 

CGCACCCGCG TCAGCAGGCC CTTCCCAGGG GGCTCGGAAC TGTACCATTT CACCTATGCC 300 

CCTGGTTCGC TTTGCTTAAG GAAGGATAAG ATAGAAGAGT CGGGGAGAGG AAGATAAAGG 360 

GGGACCCCCC AATTGGGGGG GGCGAGGACA AGAAGTAACA GGACCAGAGG GTGGGGGCTG 420 

CTGTTTGCAT CGGCCCACAC C ATG CTG ACC CCG CCG TTG CTG CTG CTC GTG 471 

Met Leu Thr Pro Pro Leu Leu Leu Leu Val 
1 5 10 

CCG CTG CTT TCA GCT CTG GTC TCC GGG GCC ACT ATG GAT GCC CCT AAA 519 
Pro Leu Leu Ser Ala Leu Val Ser Gly Ala Thr Met Asp Ala Pro Lys 
15 20 25 

ACT TGC AGC CCT AAG CAG TTT GCC TGC AGA GAC CAA ATC ACC TGT ATC, 567 
Thr Cys Ser Pro Lys Gin Phe Ala Cys Arg Asp Gin He Thr Cys He 
30 35 40 

TCA AAG GGC TGG CGG TGT GAC GGT GAA AGA GAT TGC CCC GAC GGC TCT 615 
Ser Lys Gly Trp Arg Cys Asp Gly Glu Arg Asp Cys Pro Asp Gly Ser 
45 50 55 . 

GAT GAA GCC CCT GAG ATC TGT CCA CAG AGT AAA GCC CAG AGA TGC CCG 663 
Asp Glu Ala Pro Glu He Cys Pro Gin Ser Lys Ala Gin Arg Cys Pro 
60 65 70 

CCA AAT GAG CAC AGT TGT CTG GGG ACT GAG CTA TGT GTC CCC ATG TCT 711 
Pro Asn Glu His Ser Cys Leu Gly Thr Glu Leu Cys Val Pro Met Ser 
75 80 85 90 



CGT CTC TGC AAC GGG ATC CAG GAC TGC ATG GAT GGC TCA GAC GAG GGT 759 
Arg Leu Cys Asn Gly He Gin Asp Cys Met Asp Gly Ser Asp Glu Gly 
95 100 105 

GCT CAC TGC CGA GAG CTC CGA GCC AAC TGT TCT CGA ATG GGT TGT CAA 807 
Ala His Cys Arg Glu Leu Arg Ala Asn Cys Ser Arg Met Gly Cys Gin 
110 115 120 



CAC CAT TGT GTA CCT ACA CCC AGT GGG CCC ACG TGC TAC TGT AAC AGC 
His His Cys Val Pro Thr Pro Ser Gly Pro Thr Cys Tyr Cys Asn Ser 
125 130 135 



855 
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AGC TTC CAG CTC GAG GCA GAT GGC AAG ACG TGC AAA GAT TTT GAC GAG 903 
Ser Phe Gin Leu Glu Ala Asp Gly Lys Thr Cys Lys Asp Phe Asp Glu 
140 145 150 

TGT TCC GTG TAT GGC ACC TGC AGC CAG CTT TGC ACC AAC ACA GAT GGC 951 
Cys Ser Val Tyr Gly Thr Cys Ser Gin Leu Cys Thr Asn Thr Asp Gly 
155 160 165 170 

TCC TTC ACA TGT GGC TGT GTT GAA GGC TAC CTG CTG CAA CCG GAC AAC 999 
Ser Phe Thr Cys Gly Cys Val Glu Gly Tyr Leu Leu Gin Pro Asp Asn 
175 180 185 

CGC TCC TGC AAG GCC AAG AAT GAG CCA GTA GAT CGG CCG CCA GTG CTA 1047 
Arg Ser Cys Lys Ala Lys Asn Glu Pro Val Asp Arg Pro Pro Val Leu 
190 195 200 

CTG ATT GCC AAC TCT CAG AAC ATC CTA GCT ACG TAC CTG AGT GGG GCC 1095 
Leu He Ala Asn Ser Gin Asn He Leu Ala Thr Tyr Leu Ser Gly Ala 
205 210 215 

CAA GTG TCT ACC ATC ACA CCC ACC AGC ACC CGA CAA ACC ACG GCC ATG 1143 
Gin Val Ser Thr He Thr Pro Thr Ser Thr Arg Gin Thr Thr Ala Met 
220 225 230 

GAC TTC AGT TAT GCC AAT GAG ACC GTA TGC TGG GTG CAC GTT GGG GAC 1191 
Asp Phe Ser Tyr Ala Asn Glu Thr Val Cys Trp Val His Val Gly Asp 
235 240 245 250 

AGT GCT GCC CAG ACA CAG CTC AAG TGT GCC CGG ATG CCT GGC CTG AAG 1239 
Ser Ala Ala Gin Thr Gin Leu Lys Cys Ala Arg Met Pro Gly Leu Lys 
255 260 265 

GGC TTT GTG GAT GAG CAT ACC ATC AAC ATC TCC CTC AGC CTG CAC CAC 1287 
Gly Phe Val Asp Glu His Thr He Asn He Ser Leu Ser Leu His His 
270 275 280 

GTG GAG CAG ATG GCA ATC GAC TGG CTG ACG GGA AAC TTC TAC TTT GTC 1335 
Val Glu Gin Met Ala He Asp Trp Leu Thr Gly Asn Phe Tyr Phe Val 
285 290 295 

GAC GAC ATT GAC GAC AGG ATC TTT GTC TGT AAC CGA AAC GGG GAC ACC 1383 
Asp Asp He Asp Asp Arg He Phe Val Cys Asn Arg Asn Gly Asp Thr 
300 305 310 
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TGT GTC ACT CTG CTG GAC CTG GAA CTC TAC AAC CCC AAA GGC ATC GCC 1431 
Cys Val Thr Leu Leu Asp Leu Glu Leu Tyr Asn Pro Lys Gly He Ala 
315 320 325 330 

TTG GAC CCC GCC ATG GGG AAG GTG TTC TTC ACT GAC TAC GGG CAG ATC 1479 
Leu Asp Pro Ala Met Gly Lys Val Phe Phe Thr Asp Tyr Gly Gin He 
335 340 345 

CCA AAG GTG GAG CGC TGT GAC ATG GAT GGA CAG AAC CGC ACC AAG CTG 1527 
Pro Lys Val Glu Arg Cys Asp Met Asp Gly Gin Asn Arg Thr Lys Leu 
350 355 360 



GTG GAT AGC AAG ATC GTG TTT CCA CAC GGC ATC ACC CTG GAC CTG GTC 
Val Asp Ser Lys He Val Phe Pro His Gly He Thr Leu Asp Leu Val 
365 370 375 



1575 



AGC CGC CTC GTC TAC TGG GCG GAC GCC TAC CTA GAC TAC ATC GAG GTG 
Ser Arg Leu Val Tyr Trp Ala Asp Ala Tyr Leu Asp Tyr He Glu Val 
380 385 390 



1623 



GTA GAC TAC GAA GGG AAG GGT CGG CAG ACC ATC ATC CAA GGC ATC CTG 
Val Asp Tyr Glu Gly Lys Gly Arg Gin Thr lie He Gin Gly lie Leu 
395 400 405 410 



1671 



ATC GAG CAC CTG TAC GGC CTG ACC GTG TTT GAG AAC TAT CTC TAC GCC 
He Glu His Leu Tyr Gly Leu Thr Val Phe Glu Asn Tyr Leu Tyr Ala 
. 415 420 425 



1719 



ACC AAC TCG GAC AAT GCC AAC ACG CAG CAG AAG ACG AGC GTG ATC CGA 
Thr Asn Ser Asp Asn Ala Asn Thr Gin Gin Lys Thr Ser Val He Arg 
430 435 440 



1767 



GTG AAC CGG TTC AAC AGT ACT GAG TAC CAG GTC GTC ACC CGT GTG GAC 
Val Asn Arg Phe Asn Ser Thr Glu Tyr Gin Val Val Thr Arg Val Asp 
445 450 455 



1815 



AAG GGT GGT GCC CTG CAT ATC TAC CAC CAG CGA CGC CAG CCC CGA GTG 1863 
Lys Gly Gly Ala Leu His lie Tyr His Gin Arg Arg Gin Pro Arg Val 
460 465 470 

CGG AGT CAC GCC TGT GAG AAT GAC CAG TAC GGG AAG CCA GGT GGC TGC 1911 
Arg Ser His Ala Cys Glu Asn Asp Gin Tyr Gly Lys Pro Gly Gly Cys 
475 480 485 490 
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TCC GAC ATC TGC CTC CTG GCC AAC AGT CAC AAG GCA AGG AGC TGC AGG 1959 
Ser Asp He Cys Leu Leu Ala Asn Ser His Lys Ala Arg Thr Cys Arg 
495 500 505 

TGC AGG TCT GGC TTC AGC CTG GGA AGT GAT GGG AAG TCT TGT AAG AAA 2007 
Cys Arg Ser Gly Phe Ser Leu Gly Ser Asp Gly Lys Ser Cys Lys Lys 
510 515 520 

CCT GAA CAT GAG CTG TTC CTC GTG TAT GGC AAG GGC CGA CCA GGC ATC 2055 
Pro Glu His Glu Leu Phe Leu Val Tyr Gly Lys Gly Arg Pro Gly He 
525 530 535 

ATT AGA GGC ATG GAC ATG GGG GCC AAG GTC CCA GAT GAG CAC ATG ATC 2103 
He Arg Gly Met Asp Met Gly Ala Lys Val Pro Asp Glu His Met He 
540 545 550 

CCC ATC GAG AAC CTT ATG AAT CCA CGC GCT CTG GAC TTC CAC GCC GAG 2151 
Pro He Glu Asn Leu Met Asn Pro Arg Ala Leu Asp Phe His Ala Glu 
555 560 565 570 

ACC GGC TTC ATC TAC TTT GCT GAC ACC ACC AGC TAC CTC ATT GGC CGC 2199 
Thr Gly Phe He Tyr Phe Ala Asp Thr Thr Ser Tyr Leu He Gly Arg 
575 580 585 

CAG AAA ATT GAT GGC ACG GAG AGA GAG ACT ATC CTG AAG GAT GGC ATC 2247 
Gin Lys He Asp Gly Thr Glu Arg Glu Thr He Leu Lys Asp Gly He 
590 595 600 

CAC AAT GTG GAG GGC GTA GCC GTG GAC TGG ATG GGA GAC AAT CTT TAC 2295 
His Asn Val Glu Gly Val Ala Val Asp Trp Met Gly Asp Asn Leu Tyr 
605 610 615 

TGG ACT GAT GAT GGC CCC AAG AAG ACC ATT AGT GTG GCC AGG CTG GAG 2343 
Trp Thr Asp Asp Gly Pro Lys Lys Thr He Ser Val Ala Arg Leu Glu 
620 625 630 

AAA GCC GCT CAG ACC CGG AAG ACT CTA ATT GAG GGC AAG ATG ACA CAC 2391 
Lys Ala Ala Gin Thr Arg Lys Thr Leu He Glu Gly Lys Met Thr His 
635 640 645 650 

CCC AGG GCC ATT GTA GTG GAT CCA CTC AAT GGG TGG ATG TAC TGG ACA 2439 
Pro Arg Ala He Val Val Asp Pro Leu Asn Gly Trp Met Tyr Trp Thr 
655 660 665 
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GAC T6G GAG GAG GAC CCC AAG GAC AGT CGG CGA GGG CGG CTC GAG AGG 2487 
Asp Trp Glii Glu Asp Pro Lys Asp Ser Arg Arg Gly Arg Leu Glu Arg 
670 675 680 

GCT TGG ATG GAC GGC TCA CAC CGA GAT ATC TTT GTC ACC TCC AAG ACA 2535 
Ala Trp Met Asp Gly Ser His Arg Asp He Phe Val Thr Ser Lys Thr 
685 690 695 

GTG CTT TGG CCC AAT GGG CTA AGC CTG GAT ATC CCA GCC GGA CGC CTC 2583 
Val Leu Trp Pro Asn Gly Leu Ser Leu Asp He Pro Ala Gly Arg Leu 
700 705 710 

TAG TGG GTG GAT GCC TTC TAT GAC CGA ATT GAG ACC ATA CTG CTC AAT 2631 
Tyr Trp Val Asp Ala Phe Tyr Asp Arg He Glu Thr He Leu Leu Asn 
715 ' 720 725 730 

GGC ACA GAC CGG AAG ATT GTA TAT GAG GGT CCT GAA CTG AAT CAT GCC 2679 
Gly Thr Asp Arg Lys He Val Tyr Glu Gly Pro Glu Leu Asn His Ala 
735 740 745 

TTC GGC CTG TGT CAC CAT GGC AAC TAC CTC TTT TGG ACC GAG TAC CGG 2727 
Phe Gly Leu Cys His His Gly Asn Tyr Leu Phe Trp Thr Glu Tyr Arg 
750 755 760 

AGC GGC AGC GTC TAC CGC TTG GAA CGG GGC GTG GCA GGC GCA CCG CCC 2775 
Ser Gly Ser Val Tyr Arg Leu Glu Arg Gly Val Ala Gly Ala Pro Pro 
765 770 775 

ACT GTG ACC CTT CTG CGC AGC GAG AGA CCG CCT ATC TTT GAG ATC CGA 2823 
Thr Val Thr Leu Leu Arg Ser Glu Arg Pro Pro He Phe Glu He Arg 
780 785 790 

ATG TAC GAC GCG CAC GAG CAG CAA GTG GGT ACC AAC AAA TGC CGG GTA 2871 
Met Tyr Asp Ala His Glu Gin Gin Val Gly Thr Asn Lys Cys Arg Val 
795 800 805 810 

AAT AAC GGA GGC TGC AGC AGC CTG TGC CTC GCC ACC CCC GGG AGC CGC 2919 
Asn Asn Gly Gly Cys Ser Ser Leu Cys Leu Ala Thr Pro Gly Ser Arg 
815 820 825 

CAG TGT GCC TGT GCC GAG GAC CAG GTG TTG GAC ACA GAT GGT GTC ACC 2967 
Gin Cys Ala Cys Ala Glu Asp Gin Val Leu Asp Thr Asp Gly Val Thr 
830 835 840 
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TGC TTG GCG MC CCA TCC TAC GTG CCC CCA CCC CAG TGC CAG CCG GGC 3015 
Cys Leu Ala Asn Pro Ser Tyr Val Pro Pro Pro Gin Cys Gin Pro Gly 
845 850 855 

CAG TTT GCC TGT GCC AAC AAC CGC TGC ATC CAG GAG CGC TGG AAG TGT 3063 
Gin Phe Ala Cys Ala Asn Asn Arg Cys He Gin Glu Arg Trp Lys Cys 
860 865 870 

GAC GGA GAC AAC GAC TGT CTG GAC AAC AGC GAT GAG GCC CCA GCA CTG 3111 
Asp Gly Asp Asn Asp Cys Leu Asp Asn Ser Asp Glu Ala Pro Ala Leu 
875 880 885 890 

TGC CAT CAA CAC ACC TGT CCC TCG GAC CGA TTC AAG TGT GAG AAC AAC 3159 
Cys His Gin His Thr Cys Pro Ser Asp Arg Phe Lys Cys Glu Asn Asn 
895 900 905 

CGG TGT ATC CCC AAC CGC TGG CTC TGT GAT GGG GAT AAT GAT TGT GGC 3207 
Arg Cys lie Pro Asn Arg Trp Leu Cys Asp Gly Asp Asn Asp Cys Gly 
910 915 920 

AAC AGC GAG GAC GAA TCC AAT GCC ACG TGC TCA GCC CGC ACC TGT CCA 3255 
Asn Ser Glii Asp Glu Ser Asn Ala Thr Cys Ser Ala Arg Thr Cys Pro 
925 930 935 

CCC AAC CAG TTC TCC TGT GCC AGT GGC CGA TGC ATT CCT ATC TCA TGG 3303 
Pro Asn Gin Phe Ser Cys Ala Ser Gly Arg Cys He Pro He Ser Trp 
940 945 950 

ACC TGT GAT CTG GAT GAT GAC TGT GGG GAC CGG TCC GAT GAG TCA GCC 3351 
Thr Cys Asp Leu Asp Asp Asp Cys Gly Asp Arg Ser Asp Glu Ser Ala 
955 960 965 970 

TCA TGC GCC TAC CCC ACC TGC TTC CCC CTG ACT CAA TTT ACC TGC AAC 3399 
Ser Cys Ala Tyr Pro Thr Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn 
975 980 985 

AAT GGC AGA TGT ATT AAC ATC AAC TGG CGG TGT GAC AAC GAC AAT GAC 3447 
Asn Gly Arg Cys He Asn He Asn Trp Arg Cys Asp Asn Asp Asn Asp 
990 995 1000 

TGT GGG GAC AAC AGC GAC GAA GCC GGC TGC AGT CAC TCC TGC TCC AGT 3495 
Cys Gly Asp Asn Ser Asp Glu Ala Gly Cys Ser His Ser Cys Ser Ser 
1005 1010 1015 
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ACC CA6 TTC AAG TGC AAC AGT GGC AGA TGC ATC CCC GAG CAC TGG ACG 
Thr Gin Phe Lys Cys Asn Ser Gly Arg Cys He Pro Glu His Trp Thr 
1020 1025 1030 



3543 



TGT GAT GGG GAC AAT GAT TGT GGG GAC TAC AGC GAC GAG ACA CAC GCC 
Cys Asp Gly Asp Asn Asp Cys Gly Asp Tyr Ser Asp Glu Thr His Ala 
1035 1040 1045 1050 



3591 



AAC TGT ACC AAC CAG GCT ACA AGA CCT CCT GGT GGC TGC CAC TCG GAT 3639 
Asn Cys Thr Asn Gin Ala Thr Arg Pro Pro Gly Gly Cys His Ser Asp 
1055 1060 1065 

GAG TTC CAG TGC CCG CTA GAT GGC CTG TGC ATC CCC CTG AGG TGG CGC 3687 
Glu Phe Gin Cys Pro Leu Asp Gly Leu Cys He Pro Leu Arg Trp Arg 
1070 1075 1080 



TGC GAC GGG GAC ACC GAC TGC ATG GAT TCC AGC GAT GAG AAG AGC TGT 
Cys Asp Gly Asp Thr Asp Cys Met Asp Ser Ser Asp Glu Lys Ser Cys 
1085 1090 1095 



3735 



GAG GGC GTG ACC CAT GTT TGT GAC CCG AAT GTC AAG TTT GGC TGC AAG 
Glu Gly Val Thr His Val Cys Asp Pro Asn Val Lys Phe Gly Cys Lys 
1100 1105 1110 



3783 



GAC TCC GCC CGG TGC ATC AGC AAG GCG TGG GTG TGT GAT GGC GAC AGC 
Asp Ser Ala Arg Cys He Ser Lys Ala Trp Val Cys Asp Gly Asp Ser 
1115 1120 1125 1130 



3831 



GAC TGT GAA GAT AAC TCC GAC GAG GAG AAC TGT GAG GCC CTG GCC TGC 3879 
Asp Cys Glu Asp Asn Ser Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys 
1135 1140 1145 



AGG CCA CCC TCC CAT CCC TGC GCC AAC AAC ACC TCT GTC TGC CTG CCT 
Arg Pro Pro Ser His Pro Cys Ala Asn Asn Thr Ser Val Cys Leu Pro 
1150 1155 1160 



3927 



CCT GAC AAG CTG TGC GAC GGC AAG GAT GAC TGT GGA GAC GGC TCG GAT 
Pro Asp Lys Leu Cys Asp Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp 
1165 1170 1175 



3975 



GAG GGC GAG CTC TGT GAC CAG TGT TCT CTG AAT AAT GGT GGC TGT AGT 
Glu Gly Glu Leu Cys Asp Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser 
1180 1185 1190 



4023 
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CAC AAC TGC TCA GTG GCC CCT GGT GAA GGC ATC GTG TGC TCT TGC CCT 4071 
His Asn Cys Ser Val Ala Pro Gly Glu Gly lie Val Cys Ser Cys Pro 
1195 1200 1205 1210 

CTG GGC ATG GAG CTG GGC TCT GAC AAC CAC ACC TGC CAG ATC CAG AGC 4119 
Leu Gly Met Glu Leu Gly Ser Asp Asn His Thr Cys Gin He Gin Ser 
1215 1220 1225 

TAC TGT GCC AAG CAC CTC AAA TGC AGC CAG AAG TGT GAC CAG AAC AAG 4167 
Tyr Cys Ala Lys His Leu Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys 
1230 1235 1240 

TTC AGT GTG AAG TGC TCC TGC TAC GAG GGC TGG GTC TTG GAG CCT GAC 4215 
Phe Ser Val Lys Cys Ser Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp 
1245 1250 1255 

GGG GAA ACG TGC CGC AGT CTG GAT CCC TTC AAA CTG TTC ATC ATC TTC 4263 
Gly Glu Thr Cys Arg Ser Leu Asp Pro Phe Lys Leu Phe He lie Phe 
1260 1265 1270 

TCC AAC CGC CAC GAG ATC AGG CGC ATT GAC CTT CAC AAG GGG GAC TAC 4311 
Ser Asn Arg His Glu He Arg Arg He Asp Leu His Lys Gly Asp Tyr 
1275 1280 1285 1290 

AGC GTC CTA GTG CCT GGC CTG CGC AAC ACT ATT GCC CTG GAC TTC CAC 4359 
Ser Val Leu Val Pro Gly Leu Arg Asn Thr He Ala Leu Asp Phe His 
1295 1300 1305 

CTC AGC CAG AGT GCC CTC TAC TGG ACC GAC GCG GTA GAG GAC AAG ATC 4407 
Leu Ser Gin Ser Ala Leu Tyr Trp Thr Asp Ala Val Glu Asp Lys He 
1310 1315 1320 

TAC CGT GGG AAA CTC CTG GAC AAC GGA GCC CTG ACC AGC TTT GAG GTG 44§5 
Tyr Arg Gly Lys Leu Leu Asp Asn Gly Ala Leu Thr Ser Phe Glu Val 
1325 1330 1335 

GTG ATT CAG TAT GGC TTG GCC ACA CCA GAG GGC CTG GCT GTA GAT TGG 4503 
Val He Gin Tyr Gly Leu Ala Thr Pro Glu Gly Leu Ala Val Asp Trp 
1340 1345 1350 

ATT GCA GGC AAC ATC TAC TGG GTG GAG AGC AAC CTG GAC CAG ATC GAA 4551 
He Ala Gly Asn He Tyr Trp Val Glu Ser Asn Leu Asp Gin He Glu 
1355 1360 1365 1370 
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GTG GCC AAG CTG GAC GGA ACC CTC CGA ACC ACT CTG CTG GCG GGT GAC 4599 
Val Ala Lys Leu Asp Gly Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp 
1375 1380 1385 

ATT GAG CAC CCG AGG GCC ATC GCT CTG GAC CCT CGG GAT GGG An CTG 4647 
He Glu His Pro Arg Ala He Ala Leu Asp Pro Arg Asp Gly He Leu 
1390 1395 1400 

TTT TGG ACA GAC TGG GAT GCC AGC CTG CCA CGA ATC GAG GCT GCA TCC 4695 
Phe Trp Thr Asp Trp Asp Ala Ser Leu Pro Arg He Glu Ala Ala Ser 
1405 1410 1415 

ATG AGT GGA GCT GGC CGC CGA ACC ATC CAC CGG GAG ACA GGC TCT GGG 4743 
Met Ser Gly Ala Gly Arg Arg Thr He His Arg Glu Thr Gly Ser Gly 
1420 1425 1430 

GGC TGC GCC AAT GGG CTC ACC GTG GAT TAC CTG GAG AAG CGC ATC CTC 4791 
Gly Cys Ala Asn Gly Leu Thr Val Asp Tyr Leu Glu Lys Arg He Leu 
1435 1440 1445 1450 

TGG ATT GAT GCT AGG TCA GAT GCC ATC TAT TCA GCC CGG TAT GAC GGC 4839 
Trp He Asp Ala Arg Ser Asp Ala He Tyr Ser Ala Arg Tyr Asp Gly 
1455 1460 1465 

TCC GGC CAC ATG GAG GTG CTT CGG GGA CAC GAG TTC CTG TCA CAC CCA 4887 
Ser Gly His Met Glu Val Leu Arg Gly His Glu Phe Leu Ser His Pro 
1470 1475 1480 

TTT GCC GTG ACA CTG TAC GGT GGG GAG GTG TAC TGG ACC GAC TGG CGA 4935 
Phe Ala Val Thr Leu Tyr Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg 
1485 1490 1495 

ACA AAT ACA CTG GCT AAG GCC AAC AAG TGG ACT GGC CAC AAC GTC ACC 4983 
Thr Asn Thr Leu Ala Lys Ala Asn Lys Trp Thr Gly His Asn Val Thr 
1500 1505 1510 

GTG GTA CAG AGG ACC AAC ACC CAG CCC TTC GAC CTG CAG GTG TAT CAC 5031 
Val Val Gin Arg Thr Asn Thr Gin Pro Phe Asp Leu Gin Val Tyr His 
1515 1520 1525 1530 

CCT TCC CGG CAG CCC ATG GCT CCA AAC CCA TGT GAG GCC AAT GGC GGC 5079 
Pro Ser Arg Gin Pro Met Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly 
1535 1540 1545 
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C66 GGC CCC T6T TCC CAT CTG TGC CTC ATC AAC TAC AAC CGG ACC GTC 5127 
Arg Gly Pro Cys Ser His Leu Cys Leu He Asn Tyr Asn Arg Thr Val 
1550 1555 1560 

TCC TGG GCC TGT CCC CAC CTC ATG AAG CTG CAC AAG GAC AAC ACC ACC 5175 
Ser Trp Ala Cys Pro His Leu Met Lys Leu His Lys Asp Asn Thr Thr 
1565 1570 1575 

TGC TAT GAG TTT AAG AAG TTC CTG CTG TAC GCA CGT CAG ATG GAG ATC 5223 
Cys Tyr Glu Phe Lys Lys Phe Leu Leu Tyr Ala Arg Gin Met Glu He 
1580 1585 1590 

CGG GGC GTG GAC CTG GAT GCC CCG TAC TAC AAT TAT ATC ATC TCC TTC 5271 
Arg Gly Val Asp Leu Asp Ala Pro Tyr Tyr Asn Tyr He He Ser Phe 
1595 1600 1605 1610 

ACG GTG CCT GAT ATC GAC AAT GTC ACG GTG CTG GAC TAT GAT GCC CGA 5319 
Thr Val Pro Asp He Asp Asn Val Thr Val Leu Asp Tyr Asp Ala Arg 
1615 1620 1625 

GAG CAG CGA GTT TAC TGG TCT GAT GTG CGG ACT CAA GCC ATC AAA AGG 5367 
Glu Gin Arg Val Tyr Trp Ser Asp Val Arg Thr Gin Ala He Lys Arg 
1630 1635 1640 

GCA TTT ATC AAC GGC ACT GGC GTG GAG ACC GTT GTC TCT GCA GAC TTG 5415 
Ala Phe He Asn Gly Thr Gly Val Glu Thr Val Val Ser Ala Asp Leu 
1645 1650 1655 

CCC AAC GCC CAC GGG CTG GCT GTG GAC TGG GTC TCC CGA AAT CTG TTT 5463 
Pro Asn Ala His Gly Leu Ala Val Asp Trp Val Ser Arg Asn Leu Phe 
1660 1665 1670 

TGG ACA AGT TAC GAC ACC AAC AAG AAG CAG ATT AAC GTG GCC CGG CTG 5511 
Trp Thr Ser Tyr Asp Thr Asn Lys Lys Gin He Asn Val Ala Arg Leu 
1675 1680 1685 1690 

GAC GGC TCC TTC AAG AAT GCG GTG GTG CAG GGC CTG GAG CAG CCC CAC 5559 
Asp Gly Ser Phe Lys Asn Ala Val Val Gin Gly Leu Glu Gin Pro His 
1695 1700 1705 

GGC CTG GTC GTC CAC CCG CTT CGT GGC AAG CTC TAC TGG ACT GAT GGG 5607 
Gly Leu Val Val His Pro Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly 
1710 1715 1720 
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GAC AAC ATC AGC ATG GCC MC ATG GAT GGG AGC AAC CAC ACT CTG CTC 5655 
Asp Asn lie Ser Met Ala Ash Met Asp Gly Ser Asn His Thr Leu Leu 
1725 1730 1735 

TTC AGT GGC CAG AAG GGC CCT GTG GGG TTG GCC ATT GAC TTC CCT GAG 5703 
Phe Ser Gly Gin Lys Gly Pro Val Gly Leu Ala He Asp Phe Pro Glu 
1740 1745 1750 

AGC AAA CTC TAC TGG ATC AGC TCT GGG AAC CAC ACA ATC AAC CGT TGC 5751 
Ser Lys Leu Tyr Trp He Ser Ser Gly Asn His Thr He Asn Arg Cys 
1755 1760 1765 1770 

AAT CTG GAT GGG AGC GAG CTG GAG GTC ATC GAC ACC ATG CGG AGC CAG 5799 
Asn Leu Asp Gly Ser Glu Leu Glu Val lie Asp Thr Met Arg Ser Gin 
1775 1780 1785 

CTG GGC AAG GCC ACT GCC CTG GCC ATC ATG GGG GAC AAG CTG TGG TGG 5847 
Leu Gly Lys Ala Thr Ala Leu Ala lie Met Gly Asp Lys Leu Trp Trp 
1790 1795 1800 

GCA GAT CAG GTG TCA GAG AAG ATG GGC ACG TGC AAC AAA GCC GAT GGC 5895 
Ala Asp Gin Val Ser Glu Lys Met Gly Thr Cys Asn Lys Ala Asp Gly 
1805 1810 1815 

TCT GGG TCC GTG GTG CTG CGG AAC AGT ACC ACG TTG GTT ATG CAC ATG 5943 
Ser Gly Ser Val Val Leu Arg Asn Ser Thr Thr Leu Val Met His Met 
1820 1825 1830 

AAG GTG TAT GAC GAG AGC ATC CAG CTA GAG CAT GAG GGC ACC AAC CCC 5991 
Lys Val Tyr Asp Glu Ser He Gin Leu Glu His Glu Gly Thr Asn Pro 
1835 1840 1845 1850 

TGC AGT GTC AAC AAC GGA GAC TGT TCC CAG CTC TGC CTG CCA ACA TCA 6039 
Cys Ser Val Asn Asn Gly Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser 
1855 1860 1865 

GAG ACG ACT CGC TCC TGT ATG TGT ACA GCC GGT TAC AGC CTC CGG AGC 6087 
Glu Thr Thr Arg Ser Cys Met Cys Thr Ala Gly Tyr Ser Leu Arg Ser 
1870 1875 1880 

GGA CAG CAG GCC TGT GAG GGT GTG GGC TCT TTT CTC CTG TAC TCT GTA 6135 
Gly Gin Gin Ala Cys Glu Gly Val Gly Ser Phe Leu Leu Tyr Ser Val 
1885 1890 1895 
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CAT GAG GGA ATT CGG GGG ATT CCA CTA GAT CCC AAT GAC AAG TCG GAT 6183 
His Glu Gly He Arg Gly He Pro Leu Asp Pro Asn Asp Lys Ser Asp 
1900 1905 1910 

GCC CTG GTC CCA GTG TCC GGA ACT TCA CTG GCT GTC GGA ATC GAC TTC 6231 
Ala Leu Val Pro Val Ser Gly Thr Ser Leu Ala Val Gly He Asp Phe 
1915 1920 1925 1930 

CAT GCC GAA AAT GAC ACT ATT TAT TGG GTG GAT ATG GGC CTA AGC ACC 6279 
His Ala Glu Asn Asp Thr He Tyr Trp Val Asp Met Gly Leu Ser Thr 
1935 1940 1945 

ATC AGC AGG GCC AAG CGT GAC CAG ACA TGG CGA GAG GAT GTG GTG ACC 6327 
He Ser Arg Ala Lys Arg Asp Gin Thr Trp Arg Glu Asp Val Val Thr 
1950 1955 1960 

AAC GGT ATT GGC CGT GTG GAG GGC ATC GCC GTG GAC TGG ATC GCA GGC 6375 
Asn Gly He Gly Arg Val Glu Gly He Ala Val Asp Trp He Ala Gly 
1965 1970 1975 

AAC ATA TAC TGG ACG GAC CAG GGC TTC GAT GTC ATC GAG GTT GCC CGG 6423 
Asn He Tyr Trp Thr Asp Gin Gly Phe Asp Val He Glu Val Ala Arg 
1980 1985 1990 

CTC AAT GGC TCT TTT CGT TAT GTG GTC ATT TCC CAG GGT CTG GAC AAG 6471 
Leu Asn Gly Ser Phe Arg Tyr Val Val lie Ser Gin Gly Leu Asp Lys 
1995 2000 2005 2010 

CCT CGG GCC ATC ACT GTC CAC CCA GAG AAG GGG TAC TTG TTC TGG ACC 6519 
Pro Arg Ala He Thr Val His Pro Glu Lys Gly Tyr Leu Phe Trp Thr 
2015 2020 2025 

GAG TGG GGT CAT TAC CCA CGT ATT GAG CGG TCT CGC CTT GAT GGC ACA 6567 
Glu Trp Gly His Tyr Pro Arg He Glu Arg Ser Arg Leu Asp Gly Thr 
2030 2035 2040 

GAG AGA GTG GTG TTG GTT AAT GTC AGC ATC AGC TGG CCC AAT GGC ATC 6615 
Glu Arg Val Val Leu Val Asn Val Ser lie Ser Trp Pro Asn Gly lie 
2045 2050 2055 

TCA GTA GAC TAT CAG GGC GGC AAG CTC TAC TGG TGT GAT GCT CGG ATG 6663 
Ser Val Asp Tyr Gin Gly Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met 
2060 2065 2070 
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GAC AAG ATC GAG CGC ATC GAC CTG GAA ACG GGC GAG AAC CGG GAG GTG 6711 
Asp Lys He Glu Arg He Asp Leu Glu Thr Gly Glu Asn Arg Glu Val 
2075 2080 2085 2090 

GTC CTG TCC AGC AAT AAC ATG GAT ATG TTC TCC GTG TCC GTG TTT GAG 6759 
Val Leu Ser Ser Asn Asn Met Asp Met Phe Ser Val Ser Val Phe Glu 
2095 2100 2105 

GAC TTC ATC TAC TGG AGT GAC AGA ACT CAC GCC AAT GGC TCC ATC AAG 6807 
Asp Phe He Tyr Trp Ser Asp Arg Thr His Ala Asn Gly Ser He Lys 
2110 2115 2120 

CGC GGC TGC AAA GAC AAT GCT ACA GAC TCC GTG CCT CTG AGG ACA GGC 6855 
Arg Gly Cys Lys Asp Asn Ala Thr Asp Ser Val Pro Leu Arg Thr Gly 
2125 2130 2135 

ATT GGT GTT CAG CTT AAA GAC ATC AAG GTC TTC AAC AGG GAC AGG CAG 6903 
He Gly Val Gin Leu Lys Asp He Lys Val Phe Asn Arg Asp Arg Gin 
2140 2145 2150 

AAG GGT ACC AAT GTG TGC GCG GTA GCC AAC GGC GGG TGC CAG CAG CTC 6951 
Lys Gly Thr Asn Val Cys Ala Val Ala Asn Gly Gly Cys Gin Gin Leu 
2155 2160 2165 2170 

TGC TTG TAT CGG GGT GGC GGA CAG CGA GCC TGT GCC TGT GCC CAC GGG 6999 
Cys Leu Tyr Arg Gly Gly Gly Gin Arg Ala Cys Ala Cys Ala His Gly 
2175 2180 2185 

ATG CTG GCA GAA GAC GGG GCC TCA TGC CGA GAG TAC GCT GGC TAC CTG 7047 
Met Leu Ala Glu Asp Gly Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu 
2190 2195 2200 

CTC TAC TCA GAG CGG ACC ATC CTC AAG AGC ATC CAC CTG TCG GAT GAG 7095 
Leu Tyr Ser Glu Arg Thr He Leu Lys Ser He His Leu Ser Asp Glu 
2205 2210 2215 

CGT AAC CTC AAC GCA CCG GTG CAG CCC TTT GAA GAC CCC GAG CAC ATG 7143 
Arg Asn Leu Asn Ala Pro Val Gin Pro Phe Glu Asp Pro Glu His Met 
2220 2225 2230 

AAA AAT GTC ATC GCC CTG GCC TTT GAC TAC CGA GCA GGC ACC TCC CCG 7191 
Lys Asn Val He Ala Leu Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro 
2235 2240 2245 2250 
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GGG ACC CCT AAC CGC ATC TTC TTC AGT GAC ATC CAC TTT GGG AAC ATC 7239 
Gly Thr Pro Asn Arg He Phe Phe Ser Asp lie His Phe Gly Asn lie 
2255 2260 2265 

CAG CAG ATC AAT GAC GAT GGC TCG GGC AGG ACC ACC ATC GTG GAA AAT 7287 
Gin Gin lie Asn Asp Asp Gly Ser Gly Arg Thr Thr He Val Glu Asn 
2270 2275 2280 

GTG GGC TCT GTG GAA GGC CTG GCC TAT CAC CGT GGC TGG GAC ACA CTG 7335 
Val Gly Ser Val Glu Gly Leu Ala Tyr His Arg Gly Trp Asp Thr Leu 
2285 2290 2295 

TAC TGG ACA AGC TAC ACC ACA TCC ACC ATC ACC CGC CAC ACC GTG GAC 7383 
Tyr Trp Thr Ser Tyr Thr Thr Ser Thr He Thr Arg His Thr Val Asp 
2300 2305 2310 



CAG ACT CGC CCA GGG GCC TTC GAG AGG GAG ACA GTC ATC ACC ATG TCC 7431 
Gin Thr Arg Pro Gly Ala Phe Glu Arg Glu Thr Val He Thr Met Ser 
2315 2320 2325 2330 

GGA GAC GAC CAC CCG AGA GCC TTT GTG CTG GAT GAG TGC CAG AAC CTG - . 7479 
Gly Asp Asp His Pro Arg Ala Phe Val Leu Asp Glu Cys Gin Asn Leu 
2335 2340 2345 



ATG TTC TGG ACC AAT TGG AAC GAG CTC CAT CCA AGC ATC ATG CGG GCA 7527 
Met Phe Trp Thr Asn Trp Asn Glu Leu His Pro Ser He Met Arg Ala 
2350 2355 2360 

GCC CTA TCC GGA GCC AAC GTC CTG ACC CTC ATT GAG AAG GAC ATC CGC 7575 
Ala Leu Ser Gly Ala Asn Val Leu Thr Leu He Glu Lys Asp He Arg 
2365 2370 2375 

ACG CCC AAT GGG TTG GCC ATC GAC CAC CGG GCG GAG AAG CTG TAC TTC 7623 
Thr Pro Asn Gly Leu Ala He Asp His Arg Ala Glu Lys Leu Tyr Phe 
2380 2385 2390 



TCG GAT GCC ACC TTG GAC AAG ATC GAG CGC TGC GAG TAC GAC GGC TCC 
Ser Asp Ala Thr Leu Asp Lys He Glu Arg Cys Glu Tyr Asp Gly Ser 
2395 2400 2405 2410 



7671 



CAC CGC TAT GTG ATC CTA AAG TCG GAG CCC GTC CAC CCC TTT GGG TTG 
His Arg Tyr Val He Leu Lys Ser Glu Pro Val His Pro Phe Gly Leu 
2415 2420 2425 



7719 
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GCG GTG TAC GGA GAG CAC ATT TTC TGG ACT GAC TGG GTG CGG CGG GCT 7767 
Ala Val Tyr Gly Glu His He Phe Trp Thr Asp Trp Val Arg Arg Ala 
2430 ■ 2435 2440 

GTG CAG CGA GCC AAC AAG TAT GTG GGC AGC GAC ATG AAG CTG CTT CGG 7815 
Val Gin Arg Ala Asn Lys Tyr Val Gly Ser Asp Met Lys Leu Leu Arg 
2445 2450 2455 

GTG GAC ATT CCC CAG CAA CCC ATG GGC ATC ATC GCC GTG GCC AAT GAC 7863 
Val Asp He Pro Gin Gin Pro Met Gly He He Ala Val Ala Asn Asp 
2460 2465 2470 

ACC AAC AGC TGT GAA CTC TCC CCC TGC CGT ATC AAC AAT GGA GGC TGC 7911 
Thr Asn Ser Cys Glu Leu Ser Pro Cys Arg He Asn Asn Gly Gly Cys 
2475 2480 2485 2490 

CAG GAT CTG TGT CTG CTC ACC CAC CAA GGC CAC GTC AAC TGT TCC TGT 7959 
Gin Asp Leu Cys Leu Leu Thr His Gin Gly His Val Asn Cys Ser Cys 
2495 2500 2505 

CGA GGG GGC CGG ATC CTC CAG GAG GAC TTC ACC TGC CGG GCT GTG AAC 8007 
Arg Gly Gly Arg He Leu Gin Glu Asp Phe Thr Cys Arg Ala Val Asn 
2510 2515 2520 

TCC TCT TGT CGG GCA CAA GAT GAG TTT GAG TGT GCC AAT GGG GAA TGT 8055 
Ser Ser Cys Arg Ala Gin Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys 
2525 2530 2535 

ATC AGC TTC AGC CTC ACC TGT GAT GGC GTC TCC CAC TGC AAG GAC AAG 8103 
He Ser Phe Ser Leu Thr Cys Asp Gly Val Ser His Cys Lys Asp Lys 
2540 2545 2550 

TCC GAT GAG AAG CCC TCC TAC TGC AAC TCA CGC CGC TGC AAG AAG ACT 8151 
Ser Asp Glu Lys Pro Ser Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr 
2555 2560 2565 2570 

TTC CGC CAG TGT AAC AAT GGC CGC TGT GTA TCC AAC ATG CTG TGG TGC 8199 
Phe Arg Gin Cys Asn Asn Gly Arg Cys Val Ser Asn Met Leu Trp Cys 
2575 2580 2585 

AAT GGG GTG GAT TAC TGT GGG GAT GGC TCT GAT GAG ATA CCT TGC AAC 8247 
Asn Gly Val Asp Tyr Cys Gly Asp Gly Ser Asp Glu He Pro Cys Asn 
2590 2595 2600 
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MS ACT GCC TGT GGT GTG GGT GAG TTC CGC TGC CGG GAT GGG TCC TGC 8295 
Lys Thr Ala Cys Gly Val Gly Glu Phe Arg Cys Arg Asp Gly Ser Cys 
2605 2610 2615 

ATC GGG AAC TCC AGT CGC TGC AAC CAG TTT GTG GAT TGT GAG GAT GCC 8343 
He Gly Asn Ser Ser Arg Cys Asn Gin Phe Val Asp Cys Glu Asp Ala 
2620 2625 2630 

TCG GAT GAG ATG AAT TGC AGT GCC ACA GAC TGC AGC AGC TAT TTC CGC 8391 
Ser Asp Glu Met Asn Cys Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg 
2635 2640 " 2645 2650 

CTG GGC GTG AAA GGT GTC CTC TTC CAG CCG TGC GAG CGG ACA TCC CTG 8439 
Leu Gly Val Lys Gly Val Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu 
2655 2660 2665 

TGC TAC GCA CCT AGC TGG GTG TGT GAT GGC GCC AAC GAC TGT GGA GAC 8487 
Cys Tyr Ala Pro Ser Trp Val Cys Asp Gly Ala Asn Asp Cys Gly Asp 
2670 2675 2680 

TAC AGC GAT GAA CGT GAC TGT CCA GGT GTG AAG CGC CCT AGG TGC CCG 8535 
Tyr Ser Asp Glu Arg Asp Cys Pro Gly Val Lys Arg Pro Arg Cys Pro 
2685 2690 2695 

CTC AAT TAC TTT GCC TGC CCC AGC GGG CGC TGT ATC CCC ATG AGC TGG 8583 
Leu Asn Tyr Phe Ala Cys Pro Ser Gly Arg Cys lie Pro Met Ser Trp 
2700 2705 2710 

ACG TGT GAC AAG GAG GAT GAC TGT GAG AAC GGC GAG GAT GAG ACC CAC 8631 
Thr Cys Asp Lys Glu Asp Asp Cys Glu Asn Gly Glu Asp Glu Thr His 
2715 2720 2725 2730 

TGC AAC AAG TTC TGC TCA GAG GCA CAG TTC GAG TGC CAG AAC CAC CGG 8679 
Cys Asn Lys Phe Cys Ser Glu Ala Gin Phe Glu Cys Gin Asn His Arg 
2735 2740 2745 

TGT ATC TCC AAG CAG TGG CTG TGT GAC GGT AGC GAT GAT TGC GGG GAT 8727 
Cys He Ser Lys Gin Trp Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp 
2750 2755 2760 

GGC TCC GAT GAG GCA GCT CAC TGT GAA GGC AAG ACA TGT GGC CCC TCC 8775 
Gly Ser Asp Glu Ala Ala His Cys Glu Gly Lys Thr Cys Gly Pro Ser 
2765 2770 2775 
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TCC TTC TCC TGT CCC GGC ACC CAC GTG TGT GTC CCT GAG CGC TGG CTC 8823 
Sep Phe Ser Cys Pro Gly Thr His Val Cys Val Pro Glu Arg Trp Leu 
2780 2785 2790 

TGT GAT GGC GAC AAG GAC TGT ACC GAT GGC GCG GAT GAG AGT GTC ACT 8871 
Cys Asp Gly Asp Lys Asp Cys Thr Asp Gly Ala Asp Glu Ser Val Thr 
2795 2800 2805 2810 



GCT GGC TGC CTG TAC AAC AGC ACC TGT GAT GAC CGT GAG TTC ATG TGC 8919 
Ala Gly Cys Leu Tyr Asn Ser Thr Cys Asp Asp Arg Glu Phe Met Cys 
2815 2820 2825 

CAG AAC CGC TTG TGT ATT CCC AAG CAT TTC GTG TGC GAC CAT GAC CGT 8967 
Gin Asn Arg Leu Cys He Pro Lys His Phe Val Cys Asp His Asp Arg 
2830 2835 2840 

GAC TGT GCT GAT GGC TCT GAT GAA TCC CCT GAG TGT GAG TAC CCA ACC 9015 
Asp Cys Ala Asp Gly Ser Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr 
2845 2850 2855 

TGC GGG CCC AAT GAA TTC CGC TGT GCC AAT GGG CGT TGT CTG AGC TCC 9063 
Cys Gly Pro Asn Glu Phe Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser 
2860 2865 2870 

CGT CAG TGG GAA TGT GAT GGG GAG AAT GAC TGT CAC GAC CAC AGC GAT 9111 
Arg Gin Trp Glu Cys Asp Gly Glu Asn Asp Cys His Asp His Ser Asp 
2875 2880 2885 2890 

GAG GCT CCC AAG AAC CCA CAC TGC ACC AGC CCA GAG CAC AAA TGC AAT 9159 
Glu Ala Pro Lys Asn Pro His Cys Thr Ser Pro Glu His Lys Cys Asn 
2895 2900 2905 

GCC TCA TCA CAG TTC CTG TGC AGC AGC GGG CGC TGC GTG GCT GAG GCG 9207 
Ala Ser Ser Gin Phe Leu Cys Ser Ser Gly Arg Cys Val Ala Glu Ala 
2910 2915 2920 

TTG CTC TGC AAC GGC CAG GAC GAC TGT GGG GAC GGT TCA GAC GAA CGC 9255 
Leu Leu Cys Asn Gly Gin Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg 
2925 2930 2935 

GGG TGC CAT GTC AAC GAG TGT CTC AGC CGC AAG CTC AGT GGC TGC AGT 9303 
Gly Cys His Val Asn Glu Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser 
2940 2945 2950 
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CAG GAC T6C GAG GAC CTC AAG ATA GGC TTT AAG TGC CGC TGT CGC CCG 9351 
Gin Asp Cys Glu Asp Leu Lys He Gly Phe Lys Cys Arg Cys Arg Pro 

2955 2960 2965 2970 ] 

GGC TTC CGG CTA AAG GAC GAT GGC AGG ACC TGT GCC GAC CTG GAT GAG 9399 | 
Gly Phe Arg Leu Lys Asp Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu 
2975 2980 2985 

s 

TGC AGC ACC ACC TTC CCC TGC AGC CAG CTC TGC ATC AAC ACC CAC GGA 9447 
Cys Ser Thr Thr Phe Pro Cys Ser Gin Leu Cys He Asn Thr His Gly 

2990 2995 3000 J 

AGT TAC AAG TGT CTG TGT GTG GAG GGC TAT GCA CCC CGT GGC GGT GAC 9495 
Ser Tyr Lys Cys Leu Cys Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp 
3005 3010 3015 

CCC CAC AGC TGC AAA GCT GTG ACC GAT GAG GAG CCA TTT CTC ATC TTT 9543 
Pro His Ser Cys Lys Ala Val Thr Asp Glu Glu Pro Phe Leu He Phe 
3020 3025 3030 

GCC AAC CGG TAC TAC CTG CGG AAG CTC AAC CTG GAC GGC TCC AAC TAC 9591 
Ala Asn Arg Tyr Tyr Leu Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr 
3035 3040 3045 3050 

ACA CTG CTT AAG CAG GGC CTG AAC AAT GCG GTC GCC TTG GCA TTT GAC 9639 
Thr Leu Leu Lys Gin Gly Leu Asn Asn Ala Val Ala Leu Ala Phe Asp 
3055 3060 3065 

TAC CGA GAG CAG ATG ATC TAC TGG ACG GGC GTG ACC ACC CAG GGC AGC 9687 
Tyr Arg Glu Gin Met He Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser 
3070 3075 3080 



ATG ATT CGC AGG ATG CAC CTC AAC GGC AGC AAC GTG CAG GTT CTG CAC 9735 
Met He Arg Arg Met His Leu Asn Gly Ser Asn Val Gin Val Leu His 
3085 3090 3095 

CGG ACG GGC CTT AGT AAC CCA GAT GGG CTC GCT GTG GAC TGG GTG GGT 9783 
Arg Thr Gly Leu Ser Asn Pro Asp Gly Leu Ala Val Asp Trp Val Gly 
3100 3105 3110 

GGC AAC CTG TAC TGG TGT GAC AAG GGC AGA GAT ACC ATT GAG GTG TCC 9831 
Gly Asn Leu Tyr Trp Cys Asp Lys Gly Arg Asp Thr He Glu Val Ser 
3115 3120 3125 3130 
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MS CTT AAC GGG GCC TAT C6G ACA GTG CTG GTC AGC TCT GGC CTC CGG 9879 
Lys Leu Asn Gly Ala Tyr Arg Thr Val Leu Val Ser Ser Gly Leu Arg 
3135 3140 3145 

GAG CCC AGA GCT CTG GTA GTG GAT GTA CAG AAT GGG TAC CTG TAC TGG 9927 
Glu Pro Arg Ala Leu Val Val Asp Val Gin Asn Gly Tyr Leu Tyr Trp 
3150 3155 3160 

ACA GAC TGG GGT GAC CAC TCA CTG ATC GGC CGG ATT GGC ATG GAT GGA 9975 
Thr Asp Trp Gly Asp His Ser Leu He Gly Arg lie Gly Met Asp Gly 
3165 3170 3175 

TCT GGC CGC AGC ATC ATC GTG GAC ACT AAG ATC ACA TGG CCC AAT GGC 10023 
Ser Gly Arg Ser He lie Val Asp Thr Lys He Thr Trp Pro Asn Gly 
3180 3185 3190 

CTG ACC GTG GAC TAC GTC ACG GAA CGC ATC TAC TGG GCT GAC GCC CGT 10071 
Leu Thr Val Asp Tyr Val Thr Glu Arg lie Tyr Trp Ala Asp Ala Arg 
3195 3200 3205 3210 

GAG GAC TAC ATC GAG TTC GCC AGC CTG GAT GGC TCC AAC CGT CAC GTT 10119 
Glu Asp Tyr He Glu Phe Ala Ser Leu Asp Gly Ser Asn Arg His Val 
3215 3220 3225 

GTG CTG AGC CAA GAC ATC CCA CAC ATC TTT GCG CTG ACC CTA TTT GAA 10167 
Val Leu Ser Gin Asp He Pro His He Phe Ala Leu Thr Leu Phe Glu 
3230 3235 3240 

GAC TAC GTC TAC TGG ACA GAC TGG GAA ACG AAG TCC ATC AAC CGG GCC 10215 
Asp Tyr Val Tyr Trp Thr Asp Trp Glu Thr Lys Ser He Asn Arg Ala 
3245 3250 3255 

CAC AAG ACC ACG GGT GCC AAC AAA ACA CTC CTC ATC AGC ACC CTG CAC 10263 
His Lys Thr Thr Gly Ala Asn Lys Thr Leu Leu He Ser Thr Leu His 
3260 3265 3270 

CGG CCC ATG GAC TTA CAT GTA TTC CAC GCC CTG CGC CAG CCA GAT GTG 10311 
Arg Pro Met Asp Leu His Val Phe His Ala Leu Arg Gin Pro Asp Val 
3275 3280 3285 3290 

CCC AAT CAC CCC TGC AAA GTC AAC AAT GGT GGC TGC AGC AAC CTG TGC 10359 
Pro Asn His Pro Cys Lys Val Asn Asn Gly Gly Cys Ser Asn Leu Cys 
3295 3300 3305 
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CTG CTG TCC CCT GGG GGT GGT CAC AAG TGC GCC TGC CCC ACC AAC TTC . 10407 
Leu Leu Ser Pro Gly Gly Gly His Lys Cys Ala Cys Pro Thr Asn Phe 
3310 3315 3320 



TAT CTG GGT GGC GAT GGC CGT ACC TGT GTG TCC AAC TGC ACA GCA AGC 10455 
Tyr Leu Gly Gly Asp Gly Arg Thr Cys Val Ser Asn Cys Thr Ala Ser 
3325 3330 3335 

CAG TTT GTG TGC AAA AAT GAC AAG TGC ATC CCC TTC TGG TGG AAG TGT 10503 
Gin Phe Val Cys Lys Asn Asp Lys Cys He Pro Phe Trp Trp Lys Cys 
3340 3345 3350 

GAC ACG GAG GAC GAC TGT GGG GAT CAC TCA GAC GAG CCT CCA GAC TGT 10551 
Asp Thr Glu Asp Asp Cys Gly Asp His Ser Asp Glu Pro Pro Asp Cys 
3355 3360 3365 3370 

CCC GAG TTC AAG TGC CGC CCA GGC CAG TTC CAG TGC TCC ACC GGC ATC 10599 
Pro Glu Phe Lys Cys Arg Pro Gly Gin Phe Gin Cys Ser Thr Gly He 
3375 3380 3385 

TGC ACC AAC CCT GCC TTC ATC TGT GAT GGG GAC AAT GAC TGC CAA GAC 10647 
Cys Thr Asn Pro Ala Phe He Cys Asp Gly Asp Asn Asp Cys Gin Asp 
3390 3395 3400 

AAT AGT GAC GAG GCC AAT TGC GAC ATT CAC GTC TGC TTG CCC AGC CAA 10695 
Asn Ser Asp Glu Ala Asn Cys Asp He His Val Cys Leu Pro Ser Gin 
3405 3410 3415 

TTC AAG TGC ACC AAC ACC AAC CGC TGC ATT CCT GGC ATC TTC CGT TGC 10743 
Phe Lys Cys Thr Asn Thr Asn Arg Cys He Pro Gly He Phe Arg Cys 
3420 3425 3430 



AAT GGG CAG GAC AAC TGC GGG GAC GGC GAG GAT GAG CGG GAT TGC CCT 10791 
Asn Gly Gin Asp Asn Cys Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro 
3435 3440 3445 3450 

GAG GTG ACC TGC GCC CCC AAC CAG TTC CAG TGC TCC ATC ACC AAG CGC 10839 
Glu Val Thr Cys Ala Pro Asn Gin Phe Gin Cys Ser He Thr Lys Arg 
3455 3460 3465 

TGC ATC CCT CGC GTC TGG GTC TGT GAC AGG GAT AAT CAC TGT GTG GAC 10887 
Cys He Pro Arg Val Trp Val Cys Asp Arg Asp Asn His Cys Val Asp 
3470 3475 3480 
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GGC AGT GAT GAG CCT GCC AAC TGT ACC CAA ATG ACC TGT GGA GTG GAT 10935 
Gly Ser Asp Glu Pro Ala Asn Cys Thr Gin Met Thr Cys Gly Val Asp 
3485 3490 3495 

GAG TTC CGC TGC AAG GAT TCT GGC CGC TGC ATC CCC GCG CGC TGG AAG 10983 
Glu Phe Arg Cys Lys Asp Ser Gly Arg Cys He Pro Ala Arg Trp Lys 
3500 3505 3510 

TGT GAC GGA GAA GAT GAC TGT GGG GAT GGT TCA GAT GAG CCC AAG.GAA 11031 
Cys Asp Gly Glu Asp Asp Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu 
3515 3520 3525 3530 

GAG TGT GAT GAG CGC ACC TGT GAG CCA TAC CAG TTC CGC TGC AAA AAC 11079 
Glu Cys Asp Glu Arg Thr Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn 
3535 ' 3540 3545 

AAC CGC TGT GTC CCA GGC CGT TGG CAA TGT GAC TAC GAC AAC GAC TGC 11127 
Asn Arg Cys Val Pro Gly Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys 
3550 3555 3560 

GGA GAT AAC TCG GAC GAG GAG AGC TGC ACA CCT CGG CCC TGC TCT GAG 11175 
Gly Asp Asn Ser Asp Glu Glu Ser Cys Thr Pro Arg Pro Cys Ser Glu 
3565 3570 3575 

AGT GAG TTT TTC TGT GCC AAT GGC CGC TGC ATC GCT GGG CGC TGG AAG 11223 
Ser Glu Phe Phe Cys Ala Asn Gly Arg Cys He Ala Gly Arg Trp Lys 
3580 3585 3590 

TGT GAT GGG GAC CAT GAC TGT GCC GAC GGC TCA GAC GAG AAA GAC TGC 11271 
Cys Asp Gly Asp His Asp Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys 
3595 3600 3605 3610 

ACC CCC CGC TGT GAT ATG GAC CAG TTC CAG TGC AAG AGT GGC CAC TGC 11319 
Thr Pro Arg Cys Asp Met Asp Gin Phe Gin Cys Lys Ser Gly His Cys 
3615 3620 3625 

ATC CCC CTG CGC TGG CCG TGT GAC GCG GAT GCT GAC TGT ATG GAC GGC 11367 
He Pro Leu Arg Trp Pro Cys Asp Ala Asp Ala Asp Cys Met Asp Gly 
3630 3635 3640 

AGT GAC GAG GAA GCC TGT GGC ACT GGG GTG AGG ACC TGC CCA TTG GAT 11415 
Ser Asp Glu Glu Ala Cys Gly Thr Gly Val Arg Thr Cys Pro Leu Asp 
3645 3650 3655 
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GAG TTT CAA TGT AAC AAC ACC TTG TGC AAG CCG CTG GCC TGG AAG TGT 11463 
Gin Phe Gin Cys Asn Asn Thr Leu Cys Lys Pro Leu Ala Trp Lys Cys 
3660 3665 3670 

GAT GGA GAG GAC GAC TGT GGG GAC AAC TCA GAT GAG AAC CCC GAG GAA 11511 
Asp Gly Glu Asp Asp Cys Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu 
3675 3680 3685 3690 

TGC GCC CGG TTC ATC TGC CCT CCC AAC CGG CCT TTC CGC TGC AAG AAT 11559 
Cys Ala Arg Phe He Cys Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn 
3695 3700 3705 

GAC CGA GTC TGC CTG TGG ATT GGG CGC CAG TGT GAT GGC GTG GAC AAC 11607 
Asp Arg Val Cys Leu Trp He Gly Arg Gin Cys Asp Gly Val Asp Asn 
3710 3715 3720 

TGT GGA GAT GGG ACT GAC GAG GAG GAC TGT GAG CCC CCC ACG GCC CAG 11655 
Cys Gly Asp Gly Thr Asp Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin 
3725 3730 3735 

AAC CCC CAC TGC AAA GAC AAG AAG GAG TTC CTG TGC CGA AAC CAG CGC 11703 
Asn Pro His Cys Lys Asp Lys Lys Glu Phe Leu Cys Arg Asn Gin Arg 
3740 3745 3750 

TGT CTA TCA TCC TCC CTG CGC TGT AAC ATG TTC GAT GAC TGC GGC GAT 11751 
Cys Leu Ser Ser Ser Leu Arg Cys Asn Met Phe Asp Asp Cys Gly Asp 
3755 3760 3765 3770 

GGC TCC GAT GAA GAA GAT TGC AGC ATC GAC CCC AAG CTG ACC AGC TGT 11799 
Gly Ser Asp Glu Glu Asp Cys Ser He Asp Pro Lys Leu Thr Ser Cys 
3775 3780 3785 

GCC ACC AAT GCC AGC ATG TGT GGG GAC GAA GCT CGT TGT GTG CGC ACT 11847 
Ala Thr Asn Ala Ser Met Cys Gly Asp Glu Ala Arg Cys Val Arg Thr 
3790 3795 3800 

GAG AAA GCT GCC TAC TGT GCC TGC CGC TCG GGC TTC CAT ACT GTG CCG 11895 
Glu Lys Ala Ala Tyr Cys Ala Cys Arg Ser Gly Phe His Thr Val Pro 
3805 3810 3815 

GGC CAG CCC GGA TGC CAG GAC ATC AAC GAG TGC CTG CGC TTT GGT ACC 11943 
Gly Gin Pro Gly Cys Gin Asp He Asn Glu Cys Leu Arg Phe Gly Thr 
3820 3825 3830 
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TGC TCT CAG CTC TGG AAC AAA GCC AAG GGA GGC CAC CTC TGC AGC TGT 11991 
Cys Ser Gin Leu Trp Asn Lys Pro Lys Gly Gly His Leu Cys Ser Cys 
3835 3840 3845 3850 

GCC CGC AAC TTC ATG AAG ACA CAC AAC ACC TGC AAA GCT GAA GGC TCC 12039 
Ala Arg Asn Phe Met Lys Thr His Asn Thr Cys Lys Ala Glu Gly Ser 
3855 3860 3865 

GAG TAC CAG GTG CTA TAC ATC GCG GAT GAC AAC GAG ATC CGC AGC TTG 12087 
Glu Tyr Gin Val Leu Tyr lie Ala Asp Asp Asn Glu He Arg Ser Leu 
3870 3875 3880 

TTC CCG GGC CAC CCC CAC TCA GCC TAC GAG CAG ACA TTC CAG GGC GAT 12135 
Phe Pro Gly His Pro His Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp 
3885 3890 3895 

GAG AGT GTC CGC ATA GAT GCC ATG GAT GTC CAT GTC AAG GCC GGC CGT 12183 
Glu Ser Val Arg He Asp Ala Met Asp Val His Val Lys Ala Gly Arg 
3900 3905 3910 

GTC TAC TGG ACT AAC TGG CAC ACG GGC ACA ATC TCC TAC AGG AGC CTG 12231 
Val Tyr Trp Thr Asn Trp His Thr Gly Thr He Ser Tyr Arg Ser Leu 
3915 3920 3925 3930 

CCC CCT GCC GCC CCT CCT ACC ACT TCC AAC CGC CAC CGG AGG CAG ATC 12279 
Pro Pro Ala Ala Pro Pro Thr Thr Ser Asn Arg His Arg Arg Gin He 
3935 3940 3945 

GAC CGG GGT GTC ACC CAC CTC AAT ATT TCA GGG CTG AAG ATG CCG AGG 12327 
Asp Arg Gly Val Thr His Leu Asn lie Ser Gly Leu Lys Met Pro Arg 
3950 3955 3960 

GGT ATC GCT ATC GAC TGG GTG GCC GGG AAT GTG TAC TGG ACC GAT TCC 12375 
Gly He Ala He Asp Trp Val Ala Gly Asn Val Tyr Trp Thr Asp Ser 
3965 3970 3975 

GGC CGA GAC GTG ATT GAG GTG GCG CAA ATG AAG GGC GAG AAC CGC AAG 12423 
Gly Arg Asp Val He Glu Val Ala Gin Met Lys Gly Glu Asn Arg Lys 
3980 3985 3990 

ACG CTC ATC TCG GGC ATG ATT GAT GAG CCC CAT GCC ATC GTG GTG GAC 12471 
Thr Leu He Ser Gly Met He Asp Glu Pro His Ala He Val Val Asp 
3995 4000 4005 4010 
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CCT CTG A6G GGC ACC ATG TAC TGG TCA GAC TGG GGG AAC CAC CCC AAG 12519 
Pro Leu Arg Gly Thr Met Tyr Trp Ser Asp Trp Gly Asn His Pro Lys 
4015 4020 4025 

ATT GAA ACA GCA GCG ATG GAT GGC ACC CTT CGG GAG ACT CTC GTG CAA 12567 
He Glu Thr Ala Ala Met Asp Gly Thr Leu Arg Glu Thr Leu Val Gin 
4030 4035 4040 

GAC AAC ATT CAG TGG CCT ACA GGG CTG GCT GTG GAC TAT CAC AAT GAA 12615 
Asp Asn He Gin Trp Pro Thr Gly Leu Ala Val Asp Tyr His Asn Glu 
4045 4050 4055 

CGG CTC TAC, TGG GCA GAT GCC AAG CTT TCG GTC ATC GGC AGC ATC CGG 12663 
Arg Leu Tyr Trp Ala Asp Ala Lys Leu Ser Val He Gly Ser He Arg 
4060 4065 4070 

CTC AAC GGC ACT GAC CCC ATT GTG GCT GCT GAC AGC AAA CGA GGC CTA 12711 
Leu Asn Gly Thr Asp Pro He Val Ala Ala Asp Ser Lys Arg Gly Leu 
4075 4080 4085 4090 

AGT CAC CCC TTC AGC ATC GAT GTG TFT GAA GAC TAC ATC TAC GGA GTC 12759 
Ser His Pro Phe Ser He Asp Val Phe Glu Asp Tyr He Tyr Gly Val 
4095 4100 4105 

ACT TAC ATC AAT AAT CGT GTC TTC AAG ATC CAC AAG TTT GGA CAC AGC 12807 
Thr Tyr lie Asn Asn Arg Val Phe Lys lie His Lys Phe Gly His Ser 
4110 4115 4120 

CCC TTG TAC AAC CTA ACT GGG GGC CTG AGC CAT GCC TCT GAT GTA GTC 12855 
Pro Leu Tyr Asn Leu Thr Gly Gly Leu Ser His Ala Ser Asp Val Val 
4125 4130 4135 

CTT TAC CAT CAA CAC AAG CAG CCT GAA GTG ACC AAC CCC TGT GAC CGC 12903 
Leu Tyr His Gin His Lys Gin Pro Glu Val Thr Asn Pro Cys Asp Arg 
4140 4145 4150 

AAG AAA TGC GAA TGG CTG TGT CTG CTG AGC CCC AGC GGG CCT GTC TGC 12951 
Lys Lys Cys Glu Trp Leu Cys Leu Leu Ser Pro Ser Gly Pro Val Cys 
4155 4160 4165 4170 

ACC TGT CCC AAT GGA AAG AGG CTG GAT AAT GGC ACC TGT GTG CCT GTG 12999 
Thr Cys Pro Asn Gly Lys Arg Leu Asp Asn Gly Thr Cys Val Pro Val 
4175 4180 4185 
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CCC TCT CCA ACA CCC CCT CCA GAT GCC CCT AGG CCT GGA ACC TGC ACT 13047 
Pro Ser Pro Thr Pro Pro Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr 
4190 4195 4200 

CTG CAG TGC TTC AAT GGT GGT AGT TGT TTC CTC AAC GCT CGG AGG CAG 13095 
Leu Gin Cys Phe Asn Gly Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin 
4205 4210 4215 

CCC AAG TGC CGT TGC CAG CCC CGT TAC ACA GGC GAT MG TGT GAG CTG 13143 
Pro Lys Cys Arg Cys Gin Pro Arg Tyr Thr Gly Asp Lys Cys Glu Leu 
4220 4225 4230 

GAT CAG TGC TGG GAA TAC TGT CAC AAC GGA GGC ACC TGT GCG GCT TCC 13191 
Asp Gin Cys Trp Glu Tyr Cys His Asn Gly Gly Thr Cys Ala Ala Ser 
4235 4240 4245 4250 

CCA TCT GGC ATG CCC ACG TGC CGC TGT CCC ACT GGC TTC ACG GGC CCC 13239 
Pro Ser Gly Met Pro Thr Cys Arg Cys Pro Thr Gly Phe Thr Gly Pro 
4255 4260 4265 

AAA TGC ACC GCA CAG GTG TGT GCA GGC TAC TGC TCT AAC AAC AGC ACC 13287 
Lys Cys Thr Ala Gin Val Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr 
4270 4275 4280 

TGC ACC GTC AAC CAG GGC AAC CAG CCC CAG TGC CGA TGT CTA CCT GGC 13335 
Cys Thr Val Asn Gin Gly Asn Gin Pro Gin Cys Arg Cys Leu Pro Gly 
4285 4290 4295 

TTC CTG GGC GAC CGT TGC CAG TAC CGG CAG TGC TCT GGC TTC TGT GAG 13383 
Phe Leu Gly Asp Arg Cys Gin Tyr Arg Gin Cys Ser Gly Phe Cys Glu 
4300 4305 4310 

AAC TTT GGC ACC TGT CAG ATG GCT GCT GAT GGC TCC CGA CAA TGT CGC 13431 
Asn Phe Gly Thr Cys Gin Met Ala Ala Asp Gly Ser Arg Gin Cys Arg 
4315 4320 4325 4330 

TGC ACC GTC TAC TTT GAG GGA CCA AGG TGT GAG GTG AAC AAG TGT AGT 13479 
Cys Thr Val Tyr Phe Glu Gly Pro Arg Cys Glu Val Asn Lys Cys Ser 
4335 4340 4345 

CGC TGT CTC CAA GGC GCC TGT GTG GTC AAT AAG CAG ACC GGA GAT GTC 13527 
Arg Cys Leu Gin Gly Ala Cys Val Val Asn Lys Gin Thr Gly Asp Val 
4350 4355 4360 
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ACA TGC AAC TGC ACT GAT GGC CGG GTA GCC CCC AGT TGT CTC ACC TGC 13575 
Thr Cys Asn Cys Thr Asp Gly Arg Val Ala Pro Ser Cys Leu Thr Cys 
4365 4370 4375 

ATC GAT CAC TGT AGC AAT GGT GGG TCC TGC ACC ATG AAC AGC AAG ATG 13623 
He Asp His Cys Ser Asn Gly Gly Ser Cys Thr Met Asn Ser Lys Met 
4380 4385 4390 

ATG CCT GAG TGC CAG TGC CCG CCC CAT ATG ACA GGA CCC CGG TGC CAG 13671 
Met Pro Glu Cys Gin Cys Pro Pro His Met Thr Gly Pro Arg Cys Gin 
4395 4400 4405 4410 

GAG CAG GTT GTT AGT CAG CAA CAG CCT GGG CAT ATG GCC TCC ATC CTG 13719 
Glu Gin Val Val Ser Gin Gin Gin Pro Gly His Met Ala Ser lie Leu 
4415 4420 4425 

ATC CCT CTG CTG CTG CTT CTC CTG CTG CTT CTG GTG GCT GGC GTG GTG 13767 
He Pro Leu Leu Leu Leu Leu Leu Leu Leu Leu Val Ala Gly Val Val 
4430 4435 4440 

TTC TGG TAT AAG CGG CGA GTC CGA GGG GCT AAG GGC TTC CAG CAC CAG 13815 
Phe Trp Tyr Lys Arg Arg Val Arg Gly Ala Lys Gly Phe Gin His Gin 
4445 4450 . 4455 

CGG ATG ACC AAT GGG GCC ATG AAT GTG GAA ATT GGA AAC CCT ACC TAC 13863 
Arg Met Thr Asn Gly Ala Met Asn Val Glu He Gly Asn Pro Thr Tyr 
4460 4465 4470 



AAG ATG TAT GAA GGT GGA GAG CCC GAT GAT GTC GGG GGC CTA CTG GAT 13911 
Lys Met Tyr Glu Gly Gly Glu Pro Asp Asp Val Gly Gly Leu Leu Asp 
4475 4480 4485 4490 

GCT GAT TTT GCC CTT GAC CCT GAC AAG CCT ACC AAC TTC ACC AAC CCA 13959 
Ala Asp Phe Ala Leu Asp Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro 
4495 4500 4505 

GTG TAT GCC ACG CTC TAC ATG GGG GGC CAC GGC AGC CGC CAT TCC CTG 14007 
Val Tyr Ala Thr Leu Tyr Met Gly Gly His Gly Ser Arg His Ser Leu 
4510 4515 4520 - 

GCC AGC ACG GAC GAG AAG CGA GAA CTG CTG GGC CGG GGA CCT GAA GAC 14055 
Ala Ser Thr Asp Glu Lys Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp 
4525 4530 4535 
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GAG ATA GGA GAT CCC TTG GCA TAGGGCCCTG CCCCGACGGA TGTCCCCAGA AAGC 14110 
CCCCTGCCAC ATGAGTCTTT CAATGAACCC CCTCCCCAGC CGGCCCTTCT CCGGCCCTGC 14170 
Giu lie Gly Asp Pro Leu Ala 



CGGGTGTACA MTGTAAAAA TGAAGGAATT ACTTTTTATA TGTGAGCGAG CAAGCGAGCA 14230 

AGCACAGTAT TATCTCTTTG CATTTCCTTC CTGCCTGCTC CTCAGTATCC CCCCCATGCT 14290 

GCCTTGAGGG GGCGGGGAGG GCTTTGTGGC TCAAAGGTAT GAAGGAGTCC ACATGTTCCC 14350 

TACCGAGCAT ACCCCTGGAA GCCTGGCGGC ACGGCCTCCC CACCACGCCT GTGCAAGACA 14410 

CTCAACGGGG CTCCGTGTCC CAGCTTTCCT TTCCTTGGCT CTCTGGGGTT AGTTCAGGGG 14470 

AGGTGGAGTC CTCTGCTGAC CCTGTCTGGA AGATTTGGCT CTAGCTGAGG AAGGAGTCTT 14530 

TTAGTTGAGG GAAGTCACCC CAAACCCCAG CTCCCACTTT CAGGGGCACC TCTCAGATGG 14590 

CCATGCTCAG TATCCCTTCC AGACAGGCCC TCCCCTCTCT AGCGCCCCCT CTGTGGCTCC 14650 

TAGGGCTGAA CACATTCTTT GGTAACTGTC CCCCAAGCCT CCCATCCCCC TGAGGGCCAG 14710 

GAAGAGTCGG GGCACACCAA GGAAGGGCAA GCGGGCAGCC CCATTTTGGG GACGTGAACG 14770 

TTTTAATAAT TTTTGCTGAA TTCCTTTACA ACTAAATAAC ACAGATATTG TTATAAATAA 14830 
AATTGTAAAA AAAAAAAAA 



4540 



4545 
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Met Leu Thr Pro Pro Leu Leu Leu Leu Val Pro Leu Leu Ser Ala Leu 
15 10 15 

Val Ser Gly Ala Thr Met Asp Ala Pro Lys Thr Cys Ser Pro Lys Gin 

20 25 30 

Phe Ala Cys Arg Asp Gin He Thr Cys He Ser Lys Gly Trp Arg Cys 

35 40 45 

Asp Gly Glu Arg Asp. Cys Pro Asp Gly Ser Asp Glu Ala Pro Glu He 

50 55 60 

Cys Pro Gin Ser Lys Ala Gin Arg Cys Pro Pro Asn Glu His Ser Cys 
65 70 75 80 

Leu Gly Thr Glu Leu Cys Val Pro Met Ser Arg Leu Cys Asn Gly He 

85 90 95 

Gin Asp Cys Met Asp Gly Ser Asp Glu Gly Ala His Cys Arg Glu Leu 

100 105 110 

Arg Ala Asn Cys Ser Arg Met Gly Cys Gin His His Cys Val Pro Thr 

115 ^ 120 125 

Pro Ser Gly Pro Thr Cys Tyr Cys Asn Ser Ser Phe Gin Leu Glu Ala 

130 135 140 

Asp Gly Lys Thr Cys Lys Asp Phe Asp Glu Cys Ser Val Tyr Gly Thr 
145 150 155 160 

Cys Ser Gin Leu Cys Thr Asn Thr Asp Gly Ser Phe Thr Cys Gly Cys 

165 170 175 

Val Glu Gly Tyr Leu Leu Gin Pro Asp Asn Arg Ser Cys Lys Ala Lys 

180 185 190 

Asn Glu Pro Val Asp Arg Pro Pro Val Leu Leu He Ala Asn Ser Gin 

195 200 205 

Asn He Leu Ala Thr Tyr Leu Ser Gly Ala Gin Val Ser Thr lie Thr 

210 215 220 

Pro Thr Ser Thr Arg Gin Thr Thr Ala Met Asp Phe Ser Tyr Ala Asn 
225 230 235 240 

Glu Thr Val Cys Trp Val His Val Gly Asp Ser Ala Ala Gin Thr Gin 

245 250 255 

Leu Lys Cys Ala Arg Met Pro Gly Leu Lys Gly Phe Val Asp Glu His 

260 265 270 

Thr He Asn He Ser Leu Ser Leu His His Val Glu Gin Met Ala He 

275 280 285 

Asp Trp Leu Thr Gly Asn Phe Tyr Phe Val Asp Asp He Asp Asp Arg 

290 295 300 

He Phe Val Cys Asn Arg Asn Gly Asp Thr Cys Val Thr Leu Leu Asp 
305 310 315 320 
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lie Asii Trp Arg Cys Asp Asn Asp Asn Asp Cys Gly Asp Asn Ser Asp 

995 1000 1005 

Glu Ala Gly Cys Ser His Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn 

1010 1015 1020 

Ser Gly Arg Cys He Pro Glu His Trp Thr Cys Asp Gly Asp Asn Asp 
025 1030 1035 . 1040 

Cys Gly Asp Tyr Ser Asp Glu Thr His Ala Asn Cys Thr Asn Gin Ala 

1045 1050 1055 

Thr Arg Pro Pro Gly Gly Cys His Ser Asp Glu Phe Gin Cys Pro Leu 

1060 1065 1070 

Asp Gly Leu Cys He Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp 

1075 1080 1085 

Cys Met Asp Ser Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His Val 

1090 1095 1100 

Cys Asp Pro Asn Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys He 
105 1110 1115 1120 

Ser Lys Ala Trp Val Cys Asp Gly Asp Ser Asp Cys Glu Asp Asn Ser 

1125 1130 1135 

Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys Arg Pro Pro Ser His Pro 

1140 1145 1150 

Cys Ala Asn Asn Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys Asp 

1155 1160 t v, 1165 

Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp 

1170 1175 1180 

Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala 
185 1190 1195 1200 

Pro Gly Glu Gly He Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly 

1205 1210 1215 

Ser Asp Asn His Thr Cys Gin lie Gin Ser Tyr Cys Ala Lys His Leu 

1220 1225 1230 

Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser 

1235 1240 1245 

Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp Gly Glu Thr Cys Arg Ser 

1250 1255 1260 

Leu Asp Pro Phe Lys Leu Phe lie He Phe Ser Asn Arg His Glu lie 
265 1270 1275 1280 

Arg Arg lie Asp Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly 

1285 1290 1295 

Leu Arg Asn Thr lie Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu 

1300 1305 1310 

Tyr Trp Thr Asp Ala Val Glu Asp Lys He Tyr Arg Gly Lys Leu Leu 



1315 
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Ala Leu Thr Ser Phe Glu Val Val lie Gin Tyr Gly Leu 
1335 1340 



Ala Thr Pro Glu Gly Leu Ala Val Asp Trp lie Ala Gly Asn He Tyr 
345 1350 1355 1360 

Trp Val Glu Ser Asn Leu Asp Gin lie Glu Val Ala Lys Leu Asp Gly 

1365 1370 1375 

Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala 

1380 1385 1390 

He Ala Leu Asp Pro Arg Asp Gly lie Leu Phe Trp Thr Asp Trp Asp 

1395 1400 1405 

Ala Ser Leu Pro Arg lie Glu Ala Ala Ser Met Ser Gly Ala Gly Arg 

1410 1415 1420 

Arg Thr lie His Arg Glu Thr Gly Ser Gly Gly Cys Ala Asn Gly Leu 
425 1430 1435 1440 

Thr Val Asp Tyr Leu Glu Lys Arg He Leu Trp He Asp Ala Arg Ser 

1445 1450 1455 

Asp Ala He Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val 

1460 1465 1470 

Leu Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr 

1475 1480 1485 

Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys 

1490 1495 1500 

Ala Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn 
505 1510 1515 1520 

Thr Gin Pro Phe Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met 

1525 1530 1535 

Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly Arg Gly Pro Cys Ser His 

1540 1545 1550 

Leu Cys Leu He Asn Tyr Asn Arg Thr Val Ser Trp Ala Cys Pro His 

1555 1560 • 1565 

Leu Met Lys Leu His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys 

1570 1575 1580 

Phe Leu Leu Tyr Ala Arg Gin Met Glu He Arg Gly Val Asp Leu Asp 
585 1590 1595 1600 

Ala Pro Tyr Tyr Asn Tyr lie lie Ser Phe Thr Val Pro Asp He Asp 

1605 1610 1615 

Asn Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp 

1620 1625 1630 

Ser Asp Val Arg Thr Gin Ala He Lys Arg Ala Phe He Asn Gly Thr 

1635 1640 1645 

Gly Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu 
1650 1655 1660 
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Ala Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr 
665 1670 1675 1680 

Asn Lys Lys Gin He Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn 

1685 1690 1695 

Ala Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro 

1700 1705 1710 

Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp Asn He Ser Met Ala 

1715 1720 1725 

Asn Met Asp Gly Ser Asn His Thr Leu Leu Phe Ser Gly Gin Lys Gly 

1730 1735 1740 

Pro Val Gly Leu Ala He Asp Phe Pro Glu Ser Lys Leu Tyr Trp He 
745 1750 1755 1760 

Ser Ser Gly Asn His Thr He Asn Arg Cys Asn Leu Asp Gly Ser Glu 

1765 1770 1775 

Leu Glu Val He Asp Thr Met Arg Ser Gin Leu Gly Lys Ala Thr Ala 

1780 1785 1790 

Leu Ala lie Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu 

1795 1800 1805 

Lys Met Gly Thr Cys Asn Lys Ala Asp Gly Ser Gly Ser Val Val Leu 

1810 1815 1820 

Arg Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser 
825 1830 1835 1840 

He Gin Leu Glu His Glu Gly Thr Asn Pro Cys Ser Val Asn Asn Gly 

1845 1850 1855 

Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys 

1860 1865 1870 

Met Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu 

1875 1880 1885 

Gly Val Gly Ser Phe Leu Leu Tyr Ser Val His Glu Gly He Arg Gly 

1890 1895 1900 

He Pro Leu Asp Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser 
905 1910 1915 1920 

Gly Thr Ser Leu Ala Val Gly He Asp Phe His Ala Glu Asn Asp Thr 

1925 1930 1935 

He Tyr Trp Val Asp Met Gly Leu Ser Thr lie Ser Arg Ala Lys Arg 

1940 1945 1950 

Asp Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly He Gly Arg Val 

1955 1960 1965 

Glu Gly He Ala Val Asp Trp He Ala Gly Asn He Tyr Trp Thr Asp 

1970 1975 1980 

Gin Gly Phe Asp Val He. Glu Val Ala Arg Leu Asn Gly Ser Phe Arg 
985 1990 1995 2000 
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Tyr Val Val He Ser Gin Gly Leu Asp Lys Pro Arg Ala He Thr Val 

200'5 2010 2015 

His Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly His Tyr Pro 

2020 . 2025 2030 

Arg He Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val 

2035 2040 2045 

Asn Val Ser He Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Gly 

2050 2055 2060 

Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met Asp Lys He Glu Arg He 
065 2070 2075 2080 

Asp Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn 

2085 2090 2095 

Met Asp Met Phe Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser 

2100 2105 2110 

Asp Arg Thr His Ala Asn Gly Ser He Lys Arg Gly Cys Lys Asp Asn 

2115 2120 2125 

Ala Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys 

2130 2135 2140 

Asp He Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys 
145 2150 2155 2160 

Ala Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Gly 

2165 2170 2175 

Gly Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly 

2180 , 2185 2190 

Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr 

2195 2200 2205 

He Leu Lys Ser He His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro 

2210 2215 2220 

Val Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val He Ala Leu 
225 2230 2235 2240 

Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg He 

2245 2250 2255 

Phe Phe Ser Asp He His Phe Gly Asn He Gin Gin He Asn Asp Asp 

2260 2265 2270 

Gly Ser Gly Arg Thr Thr He Val Glu Asn Val Gly Ser Val Glu Gly 

2275 2280 2285 

Leu Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr 

2290 2295 2300 

Thr Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala 
305 2310 2315 2320 

Phe Glu Arg Glu Thr Val He Thr Met Ser Gly Asp Asp His Pro Arg 
2325. 2330 2335 
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Ala Phe Val Leu Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp 

2340 . 2345 1 2350 

Asn Glu Leu His Pro Ser He Met Arg Ala Ala Leu Ser Gly Ala Asn 

2355 2360 2365 

Val Leu Thr Leu lie Glu Lys Asp He Arg Thr Pro Asn Gly Leu Ala 

2370 2375 2380 

He Asp His Arg Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp 
385 2390 2395 2400 

Lys He Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu 

2405 2410 2415 

Lys Ser Glu Pro Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His 

2420 2425 2430 

He Phe Trp Thr Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys 

2435 2440 2445 

Tyr Val Gly Ser Asp Met Lys Leu Leu Arg Val Asp He Pro Gin Gin 

2450 2455 2460 

Pro Met Gly He He Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu 
465 2470 2475 2480 

Ser Pro Cys Arg He Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu 

2485 2490 2495 

Thr His Gin Gly His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu 

2500 2505 2510 

Gin Glu Asp Phe Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin 

2515 2520 2525 

Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys He Ser Phe Ser Leu Thr 

2530 2535 2540 

Cys Asp Gly Val Ser His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser 
545 2550 2555 2560 

Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Asn Asn 

2565 2570 2575 

Gly Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Val Asp Tyr Cys 

2580 2585 2590 

Gly Asp Gly Ser Asp Glu He Pro Cys Asn Lys Thr Ala Cys Gly Val 

2595 2600 2605 

Gly Glu Phe Arg Cys Arg Asp Gly Ser Cys He Gly Asn Ser Ser Arg 

2610 2615 2620 

Cys Asn Gin Phe Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys 
625 2630 2635 2640 

Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val 

2645 2650 2655 

Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp 



2660 



2665 



2670 
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Val Cys Asp Gly Ala Asn Asp Cys Sly Asp Tyr Ser Asp Glu Arg Asp 

2675 2680 : 2685 

Cys Pro Gly Val Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys 

2690 2695 2700 

Pro Ser Gly Arg Cys He Pro Met Ser Trp Thr Cys Asp Lys Glu Asp 
705 2710 2715 2720 

Asp Cys Glu Asn Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser 

2725 2730 2735 

Glu Ala Gin Phe Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp 

2740 2745 2750 

Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala 

2755 2760 2765 

His Cys Glu Gly Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly 

2770 2775 2780 

Thr His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp 
785 2790 2795 2800 

Cys Thr Asp Gly Ala Asp Glu Ser Val Thr Ala Gly Cys Leu Tyr Asn 

2805 2810 2815 

Ser Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Asn Arg Leu Cys lie 

2820 2825 2830 

Pro Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser 

2835 2840 2845 

Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr Cys Gly Pro Asn Glu Phe 

2850 2855 2860 

Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Gin Trp Glu Cys Asp 
865 2870 2875 2880 

Gly Glu Asn Asp Cys His Asp His Ser Asp Glu Ala Pro Lys Asn Pro 

2885 2890 2895 

His Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu 

2900 2905 2910 

Cys Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin 

2915 2920 2925 

Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg Gly Cys His Val Asn Glu . 

2930 2935 2940 

Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu 
945 2950 2955 2960 

Lys lie Gly Phe Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp 

2965 2970 2975 

Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu Cys Ser Thr Thr Phe Pro 

2980 2985 2990 . 

Cys Ser Gin Leu Cys He Asn Thr His Gly Ser Tyr Lys Cys Leu Cys 
2995 3000 3005 
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Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp Pro His Ser Cys Lys Ala 

3010 3015 3020 

Val Thr Asp Glu Glu Pro Phe Leu He Phe Ala Asn Arg Tyr Tyr Leu 
025 3030 3035 3040 

Arg Lys Leu Ash Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly 

3045 3050 3055 

Leu Asn Asn Ala Val Ala Leu Ala Phe Asp. Tyr Arg Glu Gin Met He 

3060 3065 3070 

Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser Met He Arg Arg Met His 

3075 3080 3085 

Leu Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn 

3090 3095 3100 

Pro Asp Gly Leu Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys 
105 ' 3110 3115 3120 

Asp Lys Gly Arg Asp Thr He Glu Val Ser Lys Leu Asn Gly Ala Tyr 

3125 3130 3135 

Arg Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val 

3140 3145 3150 

Val Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His 

3155 3160 3165 

Ser Leu He Gly Arg He Gly Met Asp Gly Ser Gly Arg Ser He He 

3170 3175 3180 

Val Asp Thr Lys He Thr Trp Pro Asn Gly Leu Thr Val Asp Tyr Val 
185 3190 3195 3200 

Thr Glu Arg He Tyr Trp Ala Asp Ala Arg Glu Asp Tyr lie Glu Phe 

3205 3210 3215 

Ala Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp He 

3220 3225 3230 

Pro His He Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr 

3235 3240 3245 

Asp Trp Glu Thr Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Ala 

3250 3255 3260 

Asn Lys Thr Leu Leu He Ser Thr Leu His Arg Pro Met Asp Leu His 
265 3270 3275 3280 

Val Phe His Ala Leu Arg Gin Pro Asp Val Pro Asn His Pro Cys Lys 

3285 3290 3295 

Val Asn Asn Gly Gly Cys Ser Asn Leu Cys Leu Leu Ser Pro Gly Gly 

3300 3305 3310 

Gly His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Gly Asp Gly 

3315 3320 3325 

Arg Thr Cys Val Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn 
3330 3335 3340 
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Asp Lys Cys He Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys 
345 3350 3355 3360 

Gly Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg 

3365 3370 3375 

Pro Gly Gin Phe Gin Cys Ser Thr Gly He Cys Thr Asn Pro Ala Phe 

3380 3385 3390 

He Cys Asp Gly Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn 

3395 3400 3405 

Cys Asp He His Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr 

3410 3415 3420 

Asn Arg Cys He Pro Gly lie Phe Arg Cys Asn Gly Gin Asp Asn Cys 
425 3430 3435 3440 

Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro 

3445 3450 3455 

Asn Gin Phe Gin Cys Ser He Thr Lys Arg Cys He Pro Arg Val Trp 

3460 3465 3470 

Val Cys Asp Arg Asp Asn His Cys Val Asp Gly Ser Asp Glu Pro Ala 

3475 3480 3485 

Asn Cys Thr Gin Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp 

3490 3495 3500 

Ser Gly Arg Cys He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp 
505 3510 3515 3520 

Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr 

3525 3530 3535 

Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly 

3540 3545 3550 

Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu 

3555 3560 3565 

Glu Ser Cys Thr Pro Arg Pro Cys Ser Glu Ser Glu Phe Phe Cys Ala 

3570 3575 3580 

Asn Gly Arg Cys He Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp 
585 3590 3595 3600 

Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met 

3605 3610 3615 

Asp Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Pro 

3620 3625 3630 

Cys Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys 

3635 3640 3645 

Gly Thr Gly Val Arg Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn 

3650 3655 3660 

Thr Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys 
665 3670 3675 3680 
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Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe lie Cys 

3685 3690 3695 

Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp 

3700 3705 3710 

He Gly Arg Gin Cys Asp Gly Val Asp Asn Cys Gly Asp Gly Thr Asp 

3715 3720 3725 

Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin Asn Pro His Cys Lys Asp 

3730 3735 3740 

Lys Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu 
745 3750 3755 3760 

Arg Cys Asn Met Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Glu Asp 

3765 3770 3775 

Cys Ser He Asp Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser Met 

3780 3785 3790 

Cys Gly Asp Glu Ala Arg Cys Val Arg Thr Glu Lys Ala Ala Tyr Cys 

3795 3800 3805 

Ala Cys Arg Ser Gly Phe His Thr Val Pro Gly Gin Pro Gly Cys Gin 

3810 3815 3820 

Asp He Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Trp Asn 
82S 3830 3835 3840 

Lys Pro Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys 

3845 3850 3855 

Thr His Asn Thr Cys Lys Ala Glu Gly Ser GTu Tyr Gin Val Leu Tyr 

3860 3865 3870 

He Ala Asp Asp Asn Glu He Arg Ser Leu Phe Pro Gly His Pro His 

3875 3880 3885 

Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp Glu Ser Val Arg He Asp 

3890 3895 3900 

Ala Met Asp Val His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp 
905 3910 3915 3920 

His Thr Gly Thr He Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro 

3925 3930 3935 . 

Thr Thr Ser Asn Arg His Arg Arg Gin He Asp Arg Gly Val Thr His 

3940 3945 3950 

Leu Asn He Ser Gly Leu Lys Met Pro Arg Gly lie Ala He Asp Trp 

3955 3960 3965 

Val Ala Gly Asn Val Tyr Trp Thr Asp Ser Gly Arg Asp Val He Glu 

3970 3975 3980 

Val Ala Gin. Met Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met 
985 3990 3995 4000 

He Asp Glu Pro His Ala He Val Val Asp Pro Leu Arg Gly Thr Met 
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Tyr Trp Ser Asp Trp Gly Asn His Pro Lys lie Glu Thr Ala Ala Met 

4020 4025 4030 

Asp Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn lie Gin Trp Pro 

4035 4040 4045 

Thr Gly Leu Ala Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp 

4050 4055 4060 

Ala Lys Leu Ser Val He Gly Ser He Arg Leu Asn Gly Thr Asp Pro 
065 4070 4075 4080 

He Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He 

4085 4090 4095 

Asp Val Phe Glu Asp Tyr He Tyr Gly Val Thr Tyr He Asn Asn Arg 

4100 '4105 4110 

Val Phe Lys He His Lys Phe Gly His Ser Pro Leu Tyr Asn Leu Thr 

4115 4120 4125 

Gly Gly Leu Ser His Ala Ser Asp Val Val Leu Tyr His Gin His Lys 

4130 4135 4140 

Gin Pro Glu Val Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu 
145 4150 4155 4160 

Cys Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys 

4165. 4170 4175 

Arg Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro 

4180 4185 4190 

Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr Leu Gin Cys Phe Asn Gly 

4195 4200 4205 

Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin 

4210 4215 4220 

Pro Arg Tyr Thr Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu Tyr 
225 4230 4235 4240 

Cys His Asn Gly Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr 

4245 4250 4255 

Cys Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Ala Gin Val 

4260 4265 4270 

Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr Cys Thr Val Asn Gin Gly 

4275 4280 4285 

Asn Gin Pro Gin Cys Arg Cys Leu Pro Gly Phe Leu Gly Asp Arg Cys 

4290 4295 4300 

Gin Tyr Arg Gin Cys Ser Gly Phe Cys Glu Asn Phe Gly Thr Cys Gin 
305 4310 4315 4320 

Met Ala Ala Asp Gly Ser Arg Gin Cys Arg Cys Thr Val Tyr Phe Glu 

4325 4330 4335 

Gly Pro Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Gin Gly Ala 
4340 4345 4350 
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Cys Val Val Asn Lys Gin Thr Gly Asp Val Thr Cys Asn Cys Thr Asp 

4355 4360 4365 

Gly Arg Val Ala Pro Ser Cys Leu Thr Cys He Asp His Cys Ser Asn 

4370 4375 4380 

Gly Gly Ser Cys Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys 
385 4390 4395 4400 

Pro Pro His Met Thr Gly Pro Arg Cys Gin Glu Gin Val Val Ser Gin 

4405 4410 4415 

Gin Gin' Pro Gly His Met Ala Ser He Leu lie Pro Leu Leu Leu Leu 

4420 442S 4430 

Leu Leu Leu Leu Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg 

4435 4440 4445 

Val Arg Gly Ala Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala 

4450 4455 4460 

Met Asn Val Glu He Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly 
465 4470 4475 4480 

Glu Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp 

4485 4490 4495 

Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr 

4500 4505 4510 

Met Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys 

4515 4520 4525 

Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp Glu lie Gly Asp Pro Leu 
4530 4535 4540 

Ala 
545 
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GCTAGMTCC ATCTGGTCTC CTCCAGCTCC TTCTTTCTGC AAC ATG GGG AAG AAC 55 

Met Gly Lys Asn 
1 

AAA CTC CTT CAT CCA AGT CTG GTT CTT CTC CTC TTG GTC CTC CTG CCC 103 
Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu Val Leu Leu Pro 
5 10 15 20 

ACA GAC GCC TCA GTC TCT GGA AAA CCG CAG TAT ATG GTT CTG GTC CCC 151 
Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro 
25 30 35 

TCC CTG CTC CAC ACT GAG ACC ACT GAG AAG GGC TGT GTC CTT CTG AGC 199 
Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser 
40 45 50 

TAC CTG AAT GAG ACA GTG ACT GTA AGT GCT TCC TTG GAG TCT GTC AGG 247 
Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg 
55 60 65 

GGA AAC AGG AGC CTC TTC ACT GAC CTG GAG GCG GAG AAT GAC GTA CTC 295 
Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu 

70 75 r . 80 

CAC TGT GTC GCC TTC GCT GTC CCA AAG TCT TCA TCC AAT GAG GAG GTA 343 
His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val 
85 90 95 100 

ATG TTC CTC ACT GTC CAA GTG AAA GGA CCA ACC CAA GAA TTT AAG AAG 391 
Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys 
105 ■ 110 115 

CGG ACC ACA GTG ATG GTT AAG AAC GAG GAC AGT CTG GTC TTT GTC CAG 439 
Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin 
120 125 130 

ACA GAC AAA TCA ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT 487 
Thr Asp Lys Ser He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val 
135 140 145 

GTC TCC ATG GAT GAA AAC TTT CAC CCC CTG AAT GAG TTG ATT CCA CTA 535 
Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu Leu lie Pro Leu 
150 155 160 
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GTA TAC ATT CAG GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT 583 
Val Tyr He Gin Asp Pro Lys Gly Asn Arg lie Ala Gin Trp Gin Ser 
165 170 175 180 

TTC CAG TTA GAG GGT GGC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA 631 
Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser 
185 190 195 

GAG CCC TTC CAG GGC TCC TAC AAG GTG GTG GTA CAG AAG AAA TCA GGT 679 
Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly 
200 205 210 

GGA AGG ACA GAG CAC CCT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG 727 
Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys 
215 220 225 

TTT GAA GTA CAA GTA ACA GTG CCA AAG ATA ATC ACC ATC TTG GAA GAA 775 
Phe Glu Val Gin Val Thr Val Pro Lys lie lie Thr He Leu Glu Glu 
230 235 240 

GAG ATG AAT GTA TCA GTG TGT GGC CTA TAC ACA TAT GGG AAG CCT GTC 823 
Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr, Gly Lys Pro Val 
245 250 255 260 

CCT GGA CAT GTG ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GCT TCC 871 
Pro Gly His Val Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser 
265 270 275 

GAC TGC CAC GGT GAA GAT TCA CAG GCT TTC TGT GAG AAA TTC AGT GGA 919 
Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly 
280 285 290 

CAG CTA AAC AGC CAT GGC TGC TTC TAT CAG CAA GTA AAA ACC AAG GTC 967 
Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val 
295 300 305 

TTC CAG CTG AAG AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT GAG GCC 1015 
Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala 
310 315 320 

CAG ATC CAA GAA GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC 1063 
Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser 
325 330 335 340 
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AGT GAA ATC ACA AGA ACC ATA ACC AAA CTC TCA TTT GTG AAA GTG GAC 1111 
Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp 
345 350 355 

TCA CAC TTT CGA CAG GGA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA 1159 
Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin Val Arg Leu Val 
360 365 370 

GAT GGG AAA GGC GTC CCT ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA 1207 
Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He Phe He Arg Gly 
375 380 385 

AAT GAA GCA AAC TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT 1255 
Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu 
390 395 400 

GTA CAG TTC TCT ATC AAC ACC ACC AAC GTT ATG GGT ACC TCT CTT ACT 1303 
Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr 
405 410 415 420 

GTT AGG GTC AAT TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG 1351 
Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp 
425 430 435 

GTG TCA GAA GAA CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC 1399 
Val Ser Glu Glu His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe 
440 445 450 

TCC CCA AGC AAG AGC TTT GTC CAC CTT GAG CCC ATG TCT CAT GAA CTA 1447 
Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu 
455 460 465 

CCC TGT GGC CAT ACT CAG ACA GTC CAG GCA CAT TAT ATT CTG AAT GGA 1495 
Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr He Leu Asn Gly 
470 475 480 

GGC ACC CTG CTG GGG CTG AAG AAG CTC TCC TTT TAT TAT CTG ATA ATG 1543 
Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu lie Met 
485 490 495 500 

GCA AAG GGA GGC ATT GTC CGA ACT GGG ACT CAT GGA CTG CTT GTG AAG 1591 
Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly Leu Leu Val Lys 
505 510 515 
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CAG GAA GAC ATG AAG GGC CAT TTT TCC ATC TCA ATC CCT GTG AAG TCA 1639 
Gin Glu Asp Met Lys Gly His Phe Ser He Ser He Pro Val Lys Ser 
520 525 530 

GAC ATT GCT CCT GTC GCT CGG TTG CTC ATC TAT GCT GTT TTA CCT ACC 1687 
Asp lie Ala Pro Val Ala Arg Leu Leu He Tyr Ala Val Leu Pro Thr 
535 540 545 

GGG GAC GTG ATT GGG GAT TCT GCA AAA TAT GAT GTT GAA AAT TGT CTG 1735 
Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu 
550 555 560 

GCC AAC AAG GTG GAT TTG AGC TTC AGC CCA TCA CAA AGT CTC CCA GCC 1783 
Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala 
565 570 575 580 . 

TCA CAC GCC CAC CTG CGA GTC ACA GCG GCT CCT CAG TCC GTC TGC GCC 1831 
Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala 
585 590 595 

CTC CGT GCT GTG GAC CAA AGC GTG CTG CTC ATG AAG CCT GAT GCT GAG 1879 
Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu 
600 605 610 

CTC TCG GCG TCC TCG GTT TAC AAC CTG CTA CCA GAA AAG GAC CTC ACT 1927 
Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu Lys Asp Leu Thr 
615 620 625 

GGC TTC CCT GGG CCT TTG AAT GAC CAG GAC GAT GAA GAC TGC ATC AAT 1975 
Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu Asp Cys He Asn 
630 635 640 

CGT CAT AAT GTC TAT ATT AAT GGA ATC ACA TAT ACT CCA GTA TCA AGT 2023 
Arg His Asn Val Tyr He Asn Gly He Thr Tyr Thr Pro Val Ser Ser 
645 650 655 660 

ACA AAT GAA AAG GAT ATG TAC AGC TTC CTA GAG GAC ATG GGC TTA AAG 2071 
Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp Met Gly Leu Lys 
665 670 675 

GCA TTC ACC AAC TCA AAG ATT CGT AAA CCC AAA ATG TGT CCA CAG CTT 2119 
Ala Phe Thr Asn Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu 
680 685 690 
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OAA CAG TAT GAA ATG CAT GGA CCT GAA GGT CTA CGT GTA GGT TTT TAT 2167 
Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg Val Gly Phe Tyr 
695 700 705 

GAG TCA GAT GTA ATG GGA AGA GGC CAT GCA CGC CTG GTG CAT GTT GAA 2215 
Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu Val His Val Glu 
710 715 720 

GAG CCT CAC ACG GAG ACC GTA CGA AAG TAC TTC CCT GAG ACA TGG ATC 2263 
Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Trp He 
725 730 735 740 

TGG GAT TTG GTG GTG GTA AAC TCA GCA GGG GTG GCT GAG GTA GGA GTA 2311 
Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val 
745 750 755 

ACA GTC CCT GAC ACC ATC ACC GAG TGG AAG GCA GGG GCC TTC TGC CTG 2359 
Thr Val Pro Asp Thr lie Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu 
760 765 770 

TCT GAA GAT GCT GGA CTT GGT ATC TCT TCC ACT GCC TCT CTC CGA GCC 2407 
Ser Glu Asp Ala Gly Leu Gly He Ser Ser Thr Ala Ser Leu Arg Ala 
775 780 785 

TTC CAG CCC TTC TTT GTG GAG CTT ACA ATG CCT TAC TCT GTG ATT CGT 2455 
Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr Ser Val He Arg 
790 795 800 

GGA GAG GCC TTC ACA CTC AAG GCC ACG GTC CTA AAC TAC CTT CCC AAA 2503 
Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys 
805 810 815 820 

TGC ATC CGG GTC AGT GTG CAG CTG GAA GCC TCT CCC GCC TTC CTT GCT 2551 
Cys He Arg Val Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala 
825 830 835 

GTC CCA GTG GAG AAG GAA CAA GCG CCT CAC TGC ATC TGT GCA AAC GGG 2599 
Val Pro Val Glu Lys Glu Gin Ala Pro His Cys He Cys Ala Asn Gly 
840 845 850 

CGG CAA ACT GTG TCC TGG GCA GTA ACC CCA AAG TCA TTA GGA AAT GTG 2647 
Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val 
855 860 865 
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AAT TTC ACT GTG AGC GCA GAG GCA CTA GAG TCT CAA GAG CTG TGT GGG 2695 
Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly 
870 875 880 

ACT GAG GTG CCT TCA GTT CCT GAA CAC GGA AGG AAA GAC ACA GTC ATC 2743 
Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys Asp Thr Val lie 
885 890 895 900 

AAG CCT CTG TTG GTT GAA CCT GAA GGA CTA GAG AAG GAA ACA ACA TTC 2791 
Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe 
905 910 915 

I 

AAC TCC CTA CTT TGT CCA TCA GGT GGT GAG GTT TCT GAA GAA TTA TCC 2839 
Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser 
920 925 930 

f 

CTG AAA CTG CCA CCA AAT GTG GTA GAA GAA TCT GCC CGA GCT TCT GTC 2887 
Leu Lys Leu Pro Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val 
935 940 945 

TCA GTT TTG GGA GAC ATA TTA GGC TCT GCC ATG CAA AAC ACA CAA AAT 2935 
Ser Val Leu Gly Asp lie Leu Gly Ser Ala Met Gin Asn Thr Gin Asn 
950 955 960 

CTT CTC CAG ATG CCC TAT GGC TGT GGA GAG CAG AAT ATG GTC CTC TTT 2983 
Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe 
965 970 975 980 

GCT CCT AAC ATC TAT GTA CTG GAT TAT CTA AAT GAA ACA CAG CAG CTT 3031 
Ala Pro Asn He Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu 
985 990 995 

ACT CCA GAG GTC AAG TCC AAG GCC ATT GGC TAT CTC AAC ACT GGT TAC 3079 
Thr Pro Glu Val Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr 
1000 1005 1010 

CAG AGA CAG TTG AAC TAC AAA CAC TAT GAT GGC TCC TAC AGC ACC TTT 3127 
Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe 

1020 ' 1025 



GGG GAG CGA TAT GGC AGG AAC CAG GGC AAC ACC TGG CTC ACA GCC' TTT 3175 
Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe 
1030 1035 1040 
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GTT CTG AAG ACT TTT GCC CM GCT CGA GCC TAC ATC TTC ATC GAT GAA 3223 
Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr He Phe He Asp Glu 
1045 1050 1055 1060 

GCA CAC ATT ACC CAA GCC CTC ATA TGG CTC TCC CAG AGG CAG AAG GAC 3271 
Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp 
1065 1070 1075 

AAT GGC TGT TTC AGG AGC TCT GGG TCA CTG CTC AAC AAT GCC ATA AAG 3319 
Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys 
1080 1085 1090 

GGA GGA GTA GAA GAT GAA GTG ACC CTC TCC GCC TAT ATC ACC ATC GCC 3367 
Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala 
1095 1100 1105 

CTT CTG GAG ATT CCT CTC ACA GTC ACT CAC CCT GTT GTC CGC AAT GCC 3415 
Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala 
1110 1115 1120 

CTG TTT TGC CTG GAG TCA GCC TGG AAG ACA GCA CAA GAA GGG GAC CAT 3463 
Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His 
1125 1130 1135 1140 

GGC AGC CAT GTA TAT ACC AAA GCA CTG CTG GCC TAT GCT TTT GCC CTG 3511 
Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu 
1145 1150 1155 

GCA GGT AAC CAG GAC AAG AGG AAG GAA GTA CTC AAG TCA CTT AAT GAG 3559 
Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu 
1160 1165 1170 

GAA GCT GTG AAG AAA GAC AAC TCT GTC CAT* TGG GAG CGC CCT CAG AAA 3607 
Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu Arg Pro Gin Lys 
1175 1180 1185 

CCC AAG GCA CCA GTG GGG CAT TTT TAC GAA CCC CAG GCT CCC TCT GCT 3655 
Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala 
1190 1195 1200 

GAG GTG GAG ATG ACA TCC TAT GTG CTC CTC GCT TAT CTC ACG GCC CAG 3703 
Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin 
1205 1210 1215 1220 
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CCA GCC CCA ACC TCG GAG GAC CTG ACC TCT GCA ACC AAC ATC GTG AAG 3751 
Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys 
1225 1230 1235 

TGG ATC ACG AAG CAG CAG AAT GCC CAG GGC GGT TTC TCC TCC ACC CAG 3799 
Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin 
1240 1245 1250 

GAC ACA GTG GTG GCT CTC CAT GCT CTG TCC AAA TAT GGA GCC GCC ACA 3847 
Asp Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr 
1255 1260 1265 

TTT ACC AGG ACT GGG AAG GCT GCA CAG GTG ACT ATC CAG TCT TCA GGG 3895 
Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He Gin Ser Ser Gly 
1270 1275 1280 

ACA TTT TCC AGC AAA TTC CAA GTG GAC AAC AAC AAT CGC CTG TTA CTG 3943 
Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu 
1285 1290 1295 1300 

CAG CAG GTC TCA TTG CCA GAG CTG CCT GGG GAA TAC AGC ATG AAA GTG 3991 
Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val 
1305 1310 1315 

ACA GGA GAA GGA TGT GTC TAC CTC CAG ACC TCC TTG AAA TAC AAT ATT 4039 
Thr Gly Glu Gly Cys Val ,Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He 
1320 1325 1330 

CTC CCA GAA AAG GAA GAG TTC CCC TTT GCT TTA GGA GTG CAG ACT CTG 4087 
Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly Val Gin Thr Leu 
1335 1340 1345 

CCT CAA ACT TGT GAT GAA CCC AAA GCC CAC ACC AGC TTC CAA ATC TCC 4135 
Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser Phe Gin He Ser 
1350 1355 1360 

CTA AGT GTC AGT TAC ACA GGG AGC CGC TCT GCC TCC AAC ATG GCG ATC 4183 
Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He 
1365 1370 1375 1380 

GTT GAT GTG AAG ATG GTC TCT GGC TTC ATT CCC CTG AAG CCA ACA GTG 4231 
Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu Lys Pro Thr Val 
1385 1390 1395 
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AAA ATG GTT GAA AGA TCT AAC CAT 6TG AGG CGG ACA GAA GTC AGC AGC 4279 
Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser 
1400 1405 1410 

AAC CAT GTC TTG ATT TAC CTT GAT AAG GTG TCA AAT CAG ACA CTG AGC 4327 
Asn His Val Leu lie Tyr Leu Asp Lys Val Ser Asn Gin Thr Leu Ser 

1420 1425 



TTG TTC TTC ACG GTT CTG CAA GAT GTC CCA GTA AGA GAT CTC AAA CCA 4375 
Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg Asp Leu Lys Pro 
1430 1435 1440 

GCC ATA GTG AAA GTC TAT GAT TAC TAC GAG ACG GAT GAG TTT GCA ATC 4423 
Ala He Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp Glu Phe Ala He 

1450 1455 1460 



GCT GAG TAC AAT GCT CCT TGC AGC AAA GAT CTT GGA AAT GCT TGAAGACCA 4474 
Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1465 1470 1 

CAAGGCTGAA AAGTGCTTTG CTGGAGTCCT GTTCTCTGAG CTCCACAGAA GACACGTGTT 4534 
TTTGTATCTT TAAAGACTTG ATGAATAAAC ACTTTTTCTG GTC 4577 
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Ser Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro Ser Leu Leu 
1 5 10 15 

His Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser Tyr Leu Asn 

20 25 30 

Glu Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg 

35 40 45 

Ser Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val 

50 55 60 

Ala Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu 
65 70 75 80 

Thr Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr 

85 90 95 

Val Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys 

100 105 110 

Ser lie Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser Met 

115 120 125 

Asp Glu Asn Phe His Pro Leu Asn Glu Leu lie Pro Leu Val Tyr lie 

130 135 140 

Gin Asp Pro Lys Gly Asn s Arg He Ala Gin Trp Gin Ser Phe Gin Leu 
145 150 155 160 

Glu Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser Glu Pro Phe 

165 170 175 

Gin Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr 

180 185 190 

Glu His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys Phe Glu Val 

195 200 205 

Gin Val Thr Val Pro Lys He He Thr He Leu Glu Glu Glu Met Asn 

210 215 220 

Val Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His 
225 230 235 240 

Val Thr Val Ser lie Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His 

245 250 255 

Gly Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn 

260 265 270 

Ser His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val Phe Gin Leu 

275 280 285 

Lys Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin lie Gin 

290 295 300 

Glu Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He 



305 



310 



315 



320 
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Thr Arg Thr lie Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe 

325 330 335 

Arg Gin Gly He Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys 

340 345 350 

Gly Val Pro He Pro Asn Lys Val lie Phe He Arg Gly Asn Glu Ala 

355 360 365 

Asn Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe 

370 375 380 

Ser lie Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val 
385 390 395 400 

Asn Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu 

405 410 415 

Glu His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser 

420 425 . 430 

Lys Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly 

435 440 445 

His Thr Gin Thr Val Gin Ala His Tyr He Leu Asn Gly Gly Thr Leu 

450 455 460 

Leu Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met Ala Lys Gly 
465 470 475 480 

Gly He Val Arg Thr Gly Thr His Gly Leu Leu Val Lys Gin Glu Asp 

485 490 495 

Met Lys Gly His Phe Ser He Ser He Pro Val Lys Ser Asp lie Ala 

500 505 510 

Pro Val Ala Arg Leu Leu He Tyr Ala Val Leu Pro Thr Gly Asp Val 

515 520 525 

He Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala Asn Lys 

530 535 540 

Val Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala 
545 550 555 560 

His Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala 

565 570 575 

Val Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu Leu Ser Ala 

580 585 590 

Ser Ser Val Tyr Asn Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro 

595 600 605 

Gly Pro Leu Asn Asp Gin Asp Asp Glu Asp Cys lie Asn Arg His Asn 

610 615 620 

Val Tyr He Asn Gly He Thr Tyr Thr Pro Val Ser Ser Thr Asn Glu 
625 ' 630 635 640 

Lys Asp Met Tyr Ser Phe Leu Glu Asp Met Gly Leu Lys Ala Phe Thr 
645 650 655 
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Leu Asn Tyr Lys His Tyr Asp Sly Ser Tyr Ser Thr Phe Gly Glu Arg 

995 1000 1005 

Tyr Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe Val Leu Lys 

1010 1015 1020 

Thr Phe Ala Gin Ala Arg Ala Tyr He Phe He Asp Glu Ala His lie 
025 1030 1035 1040 

Thr Gin Ala Leu lie Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly Cys 

1045 1050 1055 

Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly Val 

1060 1065 1070 

Glu Asp Glu Val Thr Leu Ser Ala Tyr lie Thr He Ala Leu Leu Glu 

1075 ^ 1080 1085 

lie Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala Leu Phe Cys 

1090 1095 1100 

Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser His 
105 1110 1115 1120 

Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly Asn 

1125 1130 1135 

Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala Val 

1140 1145 1150 

Lys Lys Asp Asn Ser Val His Trp Glu Arg Pro Gin Lys Pro Lys Ala 

1155 1160 1165 

Pro Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala Glu Val Glu 

1170 1175 1180 

Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala Pro 
185 1190 1195 1200 

Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn lie Val Lys Trp He Thr 

1205 1210 1215 

Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin Asp Thr Val 

1220 1225 1230 

Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr Arg 

1235 1240 1245 

Thr Gly Lys Ala Ala Gin Val Thr lie Gin Ser Ser Gly Thr Phe Ser 

1250 1255 1260 

Ser Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu Gin Gin Val 
265 1270 1275 1280 

Ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val Thr Gly Glu 

1285 1290 1295 

Gly Cys Val lyr ,Leu Gin Thr Ser Leu Lvs Tvr Asn lie Leu Pro Glu 



Lys I 
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SEQUENCE LISTING 



<110> Antigenics, Inc. 

<120> COMPLEXES OF ALPHA (2) MACROGLOBULIN AND ANTIGENIC 
MOLECULES FOR IMMUNOTHERAPY 

<130> 8449-178-228 

<150> 09/625,139 
<151> 2000-07-25 

<150> 60/209,266 
<151> 2000-06-02 

<160> 5 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 14849 

<212> DNA 

<213> Mus musculus 

<400> 1 

cgctgctccc cgccagtgca ctgaggaggc ggaaacgggg gagcccctag tgctccatca 60 

ggcccctacc aaggcacccc catcgggtcc acgcccccca ccccccaccc cgcctcctcc 120 

caattgtgca tttttgcagc cggagtcggc tccgagatgg ggctgtgagc ttcgccctgg 180 

gagggggaga ggagcgagga gtaaagcagg ggtgaagggt tcgaatttgg gggcaggggg 240 

cgcacccgcg tcagcaggcc cttcccaggg ggctcggaac tgtaccattt cacctatgcc 300 

cctggttcgc tttgcttaag gaaggataag atagaagagt cggggagagg aagataaagg 360 

gggacccccc aattgggggg ggcgaggaca agaagtaaca ggaccagagg gtgggggctg 420 

ctgtttgcat cggcccacac catgctgacc ccgccgttgc tgctgctcgt gccgctgctt 480 

tcagctctgg tctccggggc cactatggat gcccctaaaa cttgcagccc taagcagttt 540 

gcctgcagag accaaatcac ctgtatctca aagggctggc ggtgtgacgg tgaaagagat 600 

tgccccgacg gctctgatga agcccctgag atctgtccac agagtaaagc ccagagatgc 660 

ccgccaaatg agcacagttg tctggggact gagctatgtg tccccatgtc tcgtctctgc 720 

aacgggatcc aggactgcat ggatggctca gacgagggtg ctcactgccg agagctccga 780 

gccaactgtt ctcgaatggg ttgtcaaeac cattgtgtac ctacacccag tgggcccacg 840 

tgctactgta acagcagctt ccagctcgag gcagatggca agacgtgcaa agattttgac 900 

gagtgttccg tgtatggcac ctgcagccag ctttgcacca acacagatgg ctccttcaca 960 

tgtggctgtg ttgaaggcta cctgctgcaa ccggacaacc gctcctgcaa ggccaagaat 1020 

gagccagtag atcggccgcc agtgctactg attgccaact ctcagaacat cctagctacg 1080 

tacctgagtg gggcccaagt gtctaccatc acacccacca gcacccgaca aaccacggcc 1140 

atggacttca gttatgccaa tgagaccgta tgctgggtgc acgttgggga cagtgctgcc 1200 

cagacacagc tcaagtgtgc ccggatgcct ggcctgaagg gctttgtgga tgagcatacc 1260 

atcaacatct ccctcagcct gcaccacgtg gagcagatgg caatcgactg gctgacggga 1320 

aacttctact ttgtcgacga cattgacgac aggatctttg tctgtaaccg aaacggggac 1380 

acctgtgtca ctctgctgga cctggaactc tacaacccca aaggcatcgc cttggacccc 1440 

gccatgggga aggtgttctt cactgactac gggcagatcc caaaggtgga gcgctgtgac 1500 

atggatggac agaaccgcac caagctggtg gatagcaaga tcgtgtttcc acacggcatc- 1560 

accctggacc tggtcagccg cctcgtctac tgggcggacg cctacctaga ctacatcgag 1620 

gtggtagact acgaagggaa gggtcggcag accatcatcc aaggcatcct gatcgagcac 1680 

ctgtacggcc tgaccgtgtt tgagaactat ctctacgcca ccaactcgga caatgccaac 1740 

acgcagcaga agacgagcgt gatccgagtg aaccggttca acagtactga gtaccaggtc 1800 

gtcacccgtg tggacaaggg tggtgccctg catatctacc accagcgacg ccagccccga 1860 

gtgcggagtc acgcctgtga gaatgaccag tacgggaagc caggtggctg ctccgacatc 1920 



1 



WO 01/91787 



PCT/US01/18047 



tgcctcctgg ccaacagtca caaggcaagg acctgcaggt gcaggtctgg cttcagcctg 1980 

ggaagtgatg ggaagtcttg taagaaacct gaacatgagc tgttcctcgt gtatggcaag 2040 

ggecgaccag gcatcattag aggcatggac atgggggcca aggtcccaga tgagcacatg 2100 

atccccatcg agaaccttat gaatccacgc gctctggact tccacgccga gaccggcttc 2160 

atctactttg ctgacaccac cagctacctc attggccgcc agaaaattga tggcacggag 2220 

agagagacta tcctgaagga tggcatccac aatgtggagg gcgtagccgt ggactggatg 2280 

ggagacaatc tttactggac tgatgatggc cccaagaaga ccattagtgt ggccaggctg 2340 

gagaaagccg ctcagacccg gaagactcta attgagggca agatgacaca ccccagggcc 2400 

attgtagtgg atccactcaa tgggtggatg tactggacag actgggagga ggaccccaag 2460 

gacagtcggc gagggcggct cgagagggct tggatggacg gctcacaccg agatatcttt 2520 

gtcacctcca agacagtgct ttggcccaat gggctaagcc tggatatccc agccggacgc 2580 

ctctactggg tggatgcctt ctatgaccga attgagacca tactgctcaa tggcacagac 2640 

cggaagattg tatatgaggg tcctgaactg aatcatgcct tcggcctgtg tcaccatggc 2700 

aactacctct tttggaccga gtaccggagc ggcagcgtct accgcttgga acggggcgtg 2760 

gcaggcgcac cgcccactgt gacccttctg cgcagcgaga gaccgcctat ctttgagatc 2820 

cgaatgtacg acgcgcacga gcagcaagtg ggtaccaaca aatgccgggt aaataacgga 2880 

ggctgcagca gcctgtgcct cgccaccccc gggagccgcc agtgtgcctg tgccgaggac 2940 

caggtgttgg acacagatgg tgtcacctgc ttggcgaacc catcctacgt gcccccaccc 3000 

cagtgccagc cgggccagtt tgcctgtgcc aacaaccgct gcatccagga gcgctggaag 3060 

tgtgacggag acaacgactg tctggacaac agcgatgagg ccccagcact gtgccatcaa 3120 

cacacctgtc cctcggaccg attcaagtgt gagaacaacc ggtgtatccc caaccgctgg 3180 

ctctgtgatg gggataatga ttgtggcaac agcgaggacg aatccaatgc cacgtgctca 3240 

gcccgcacct gtccacccaa ccagttctcc tgtgccagtg gccgatgcat tcctatctca 3300 

tggacctgtg atctggatga tgactgtggg gaccggtccg atgagtcagc ctcatgcgcc 3360 

taccccacct gcttccccct gactcaattt acctgcaaca atggcagatg tattaacatc 3420 

aactggcggt gtgacaacga caatgactgt ggggacaaca gcgacgaagc cggctgcagt 3480 

cactcctgct ccagtaccca gttcaagtgc aacagtggca gatgcatccc cgagcactgg 354 0 

acgtgtgatg gggacaatga ttgtggggac tacagcgacg agacacacgc caactgtacc 3600 

aaccaggcta caagacctcc tggtggctgc cactcggatg agttccagtg cccgctagat 3660 

ggcctgtgca tccccctgag gtggcgctgc gacggggaca ccgactgcat ggattccagc 3720 

gatgagaaga gctgtgaggg cgtgacccat gtttgtgacc cgaatgtcaa gtttggctgc 3780 

aaggactccg cccggtgcat cagcaaggcg tgggtgtgtg atggcgacag cgactgtgaa 3840 

gataactccg acgaggagaa ctgtgaggcc ctggcctgca ggccaccctc ccatccctgc 3900 

gccaacaaca cctctgtctg cctgcctcct gacaagctgt gcgacggcaa ggatgactgt 3960 

ggagacggct cggatgaggg cgagctctgt gaccagtgtt ctctgaataa tggtggctgt 4020 

agtcacaact gctcagtggc ccctggtgaa ggcatcgtgt gctcttgccc tctgggcatg 4080 

gagctgggct ctgacaacca cacctgccag atccagagct actgtgccaa gcacctcaaa 4140 

tgcagccaga agtgtgacca gaacaagttc agtgtgaagt gctcctgcta cgagggctgg 4200 

gtcttggagc ctgacgggga aacgtgccgc agtctggatc ccttcaaact gt teat cat c 42 60 

ttctccaacc gecacgagat caggegcatt gaccttcaca agggggacta cagcgtccta 432 0 

gtgcctggcc ,tgcgcaacac tattgecctg gacttccacc teagecagag tgccctctac 4380 

tggaccgacg eggtagagga caagatctac cgtgggaaac tcctggacaa cggagccctg 4440 

accagctttg aggtggtgat tcagtatggc ttggccacac cagagggect ggctgtagat 4500 

tggattgcag gcaacatcta ctgggtggag agcaacctgg accagatcga agtggccaag 4560 

ctggacggaa ccctccgaac cactctgctg gcgggtgaca ttgagcaccc gagggecate 4620 

gctctggacc ctegggatgg gattctgttt tggacagact gggatgccag cctgccacga 4680 

atcgaggctg catccatgag tggagctggc cgccgaacca tccaccggga gacaggctct 4740 

999g9 ct 9 c 9 ccaatgggct caccgtggat tacctggaga agcgcatcct ctggattgat 4 800 

gctaggtcag atgecatcta ttcagcccgg tatgaegget ccggccacat ggaggtgctt 4860 

eggggacacg agttcctgtc acacccattt gccgtgacac tgtacggtgg ggaggtgtac 492 0 

tggaccgact ggcgaacaaa tacactggct aaggecaaca agtggactgg ccacaacgtc 4980 

accgtggtac agaggaccaa cacccagccc ttcgacctgc aggtgtatca cccttcccgg 5040 

cagcccatgg ctccaaaccc atgtgaggcc aatggcggcc ggggcccctg ttcccatctg 5100 

tgcctcatca actacaaccg gaccgtctcc tgggcctgtc cccacctcat gaagctgeae 5160 

aaggacaaca ccacctgcta tgagtttaag aagttcctgc tgtacgcacg tcagatggag 5220 

ateeggggeg tggacctgga tgccccgtac tacaattata tcatctcctt cacggtgcct 5280 

gatatcgaca atgtcacggt gctggactat gatgeccgag ageagegagt ttactggtct 5340 

gatgtgcgga ctcaagccat caaaagggca tttatcaacg gcactggcgt ggagaccgtt 5400 
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gtctctgcag acfetgcccaa cgcccacggg ctggctgtgg actgggtctc ccgaaatctg 5460 

ttttggacaa gttacgacac caacaagaag cagattaacg tggcccggct ggacggctcc 5520 

ttcaagaatg cggtggtgca gggcctggag cagccccacg gcctggtcgt ccacccgctt 5580 

cgtggcaagc tctactggac tgatggggac aacatcagca tggccaacat ggatgggagc 5640 

aaccacactc tgctcttcag tggccagaag ggccctgtgg ggttggccat tgacttccct 5700 

gagagcaaac tctactggat cagctctggg aaccacacaa tcaaccgttg caatctggat 5760 

gggagcgagc tggaggtcat cgacaccatg cggagccagc tgggcaaggc cactgccctg 5 820 

gccatcatgg gggacaagct gtggtgggca gatcaggtgt cagagaagat gggcacgtgc 5880 

aacaaagccg atggctctgg gtccgtggtg ctgcggaaca gtaccacgtt ggttatgcac 5940 

atgaaggtgt atgacgagag catccagcta gagcatgagg gcaccaaccc ctgcagtgtc 6000 

aacaacggag actgttccca gctctgcctg ccaacatcag agacgactcg ctcctgtatg 6060 

tgtacagccg gttacagcct ccggagcgga cagcaggcct gtgagggtgt gggctctttt 6120 

ctcctgtact ctgtacatga gggaattcgg gggattccac tagatcccaa tgacaagtcg 6180 

gatgccctgg tcccagtgtc cggaacttca ctggctgtcg gaatcgactt ccatgccgaa 624 0 

aatgacacta tttattgggt ggatatgggc ctaagcacca tcagcagggc caagcgtgac 6300 

cagacatggc gagaggatgt ggtgaccaac ggtattggcc gtgtggaggg catcgccgtg 6360 

gactggatcg caggcaacat atactggacg gaccagggct tcgatgtcat cgaggttgcc 6420 

cggctcaatg gctcttttcg ttatgtggtc atttcccagg gtctggacaa gcctcgggcc 6480 

atcactgtcc acccagagaa ggggtacttg ttctggaccg agtggggtca ttacccacgt 6540 

attgagcggt ctcgccttga tggcacagag agagtggtgt tggttaatgt cagcatcagc 6600 

tggcccaatg gcatctcagt agactatcag ggcggcaagc tctactggtg tgatgctcgg 6660 

atggacaaga tcgagcgcat cgacctggaa acgggcgaga accgggaggt ggtcctgtcc 672 0 

agcaataaca tggatatgtt ctccgtgtcc gtgtttgagg acttcatcta ctggagtgac 6780 

agaactcacg ccaatggctc catcaagcgc ggctgcaaag acaatgctac agactccgtg 684 0 

cctctgagga caggcattgg tgttcagctt aaagacatca aggtcttcaa cagggacagg 6900 

cagaagggta ccaatgtgtg cgcggtagcc aacggcgggt gccagcagct ctgcttgtat 6960 

cggggtggcg gacagcgagc ctgtgcctgt gcccacggga tgctggcaga agacggggcc 7020 

tcatgccgag agtacgctgg ctacctgctc tactcagagc ggaccatcct caagagcatc 7080 

cacctgtegg atgagcgtaa cctcaacgca ccggtgcagc cctttgaaga ccccgagcac 714 0 

atgaaaaatg tcatcgccct ggcctttgac taccgagcag gcacctcccc ggggacccct 7200 

aaccgcatct tcttcagtga catccacttt gggaacatcc agcagatcaa tgacgatggc 7260 

tcgggcagga ccaccatcgt ggaaaatgtg ggctctgtgg aaggcctggc ctatcaccgt 7320 

ggctgggaca cactgtactg gacaagctac accacatcca ccatcacccg ccacaccgtg 73 80 

gaccagactc gcccaggggc cttcgagagg gagacagtca tcaccatgtc cggagacgac 744 0 

cacccgagag cctttgtgct ggatgagtgc cagaacctga tgttctggac caattggaac 7500 

gagctccatc caagcatcat gcgggcagcc ctatccggag ccaacgtcct gaccctcatt 7560 

gagaaggaca tccgcacgcc caatgggttg gccatcgacc accgggcgga gaagctgtac 7620 

ttctcggatg ccaccttgga caagatcgag cgctgcgagt acgacggctc ccaccgctat 7680 

gtgatcctaa agtcggagcc cgtccacccc tttgggttgg cggtgtacgg agagcacatt 774 0 

ttctggactg actgggtgcg gcgggctgtg cagcgagcca acaagtatgt gggcagcgac 780 0 

atgaagctgc ttcgggtgga cattccccag caacccatgg gcatcatcgc cgtggccaat 7860 

gacaccaaca gctgtgaact ctccccctgc cgtatcaaca atggaggctg ccaggatctg 7920 

tgtctgctca cccaccaagg ccacgtcaac tgttcctgtc gagggggccg gatcctccag 7980 

gaggacttca cctgccgggc tgtgaactcc tcttgtcggg cacaagatga gtttgagtgt 8040 

gccaatgggg aatgtatcag cttcagcctc acctgtgatg gcgtctccca ctgcaaggac 8100 

aagtccgatg agaagccctc ctactgcaac tcacgccgct gcaagaagac tttccgccag 8160 

tgtaacaatg gccgctgtgt atccaacatg ctgtggtgca atggggtgga ttactgtggg 8220 

gatggctctg atgagatacc ttgcaacaag actgcctgtg gtgtgggtga gttccgctgc 8280 

cgggatgggt cctgcatcgg gaactccagt cgctgcaacc agtttgtgga ttgtgaggat 8340 

gcctcggatg agatgaattg cagtgccaca gactgcagca gctatttccg cctgggcgtg 8400 

aaaggtgtcc tcttccagcc gtgcgagcgg acatccctgt gctacgcacc tagctgggtg 84 60 

tgtgatggcg ccaacgactg tggagactac agcgatgaac gtgactgtcc aggtgtgaag 8520 

cgccctaggt gceegctcaa ttactttgcc tgccccagcg ggcgctgtat ccccatgagc 8580 

tggacgtgtg acaaggagga tgactgtgag aacggcgagg atgagaccca ctgcaacaag 864 0 

ttctgctcag aggcacagtt cgagtgccag aaccaccggt gtatctccaa gcagtggctg 8700 

tgtgacggta gcgatgattg cggggatggc tccgatgagg cagctcactg tgaaggcaag 8760 

acatgtggcc cctcctcctt ctcctgtccc ggcacccacg tgtgtgtccc tgagcgctgg 882 0 

ctctgtgatg gcgacaagga ctgtaccgat ggcgcggatg agagtgtcac tgctggctgc 8880 
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ctgtacaaca gcacctgtga tgaccgtgag ttcatgtgcc agaaccgctt gtgtattccc 8940 

aagcatttcg tgtgcgacca tgaccgtgac tgtgctgatg gctctgatga atcccctgag 9000 

tgtgagtacc caacctgcgg gcccaatgaa ttccgctgtg ccaatgggcg ttgtctgagc 9060 

tcccgtcagt gggaatgtga tggggagaat gactgtcacg accacagcga tgaggctccc 9120 

aagaacccac actgcaccag cccagagcac aaatgcaatg cctcatcaca gttcctgtgc 9180 

agcagcgggc gctgcgtggc tgaggcgttg ctctgcaacg gccaggacga ctgtggggac 9240 

ggttcagacg aacgcgggtg ccatgtcaac gagtgtctca gccgcaagct cagtggctgc 9300 

agtcaggact gcgaggacct caagataggc tttaagtgcc gctgtcgccc gggcttccgg 9360 

ctaaaggacg atggcaggac ctgtgccgac ctggatgagt gcagcaccac cttcccctgc 9420 

agccagctct gcatcaacac ccacggaagt tacaagtgtc tgtgtgtgga gggctatgca 9480 

ccccgtggcg gtgaccccca cagctgcaaa gctgtgaccg atgaggagcc atttctcatc 9540 

tttgccaacc ggtactacct gcggaagctc aacctggacg gctccaacta cacactgctt 9600 

aagcagggcc tgaacaatgc ggtcgccttg gcatttgact accgagagca gatgatctac 9660 

tggacgggcg tgaccaccca gggcagcatg attcgcagga tgcacctcaa cggcagcaac 972 0 

gtgcaggttc tgcaccggac gggccttagt aacccagatg ggctcgctgt ggactgggtg 9780 

ggtggcaacc tgtactggtg tgacaagggc agagatacca ttgaggtgtc caagcttaac 9840 

ggggcctatc ggacagtgct ggtcagctct ggcctccggg agcccagagc tctggtagtg 9900 

gatgtacaga atgggtacct gtactggaca gactggggtg accactcact gatcggccgg 9960 

attggcatgg atggatctgg ccgcagcatc atcgtggaca ctaagatcac atggcccaat 1002 0 

ggcctgaccg tggactacgt cacggaacgc atctactggg ctgacgcccg tgaggactac 10080 

atcgagttcg ccagcctgga tggctccaac cgtcacgttg tgctgagcca agacatccca 10140 

cacatctttg cgctgaccct atttgaagac tacgtctact ggacagactg ggaaacgaag 10200 

tccatcaacc gggcccacaa gaccacgggt gccaacaaaa cactcctcat cagcaccctg 10260 

caccggccca tggacttaca tgtattccac gccctgcgcc agccagatgt gcccaatcac 10320 

ccctgcaaag tcaacaatgg tggctgcagc aacctgtgcc tgctgtcccc tgggggtggt 10380 

cacaagtgcg cctgccccac caacttctat ctgggtggcg atggccgtac ctgtgtgtcc 10440 

aactgcacag caagccagtt tgtgtgcaaa aatgacaagt gcatcccctt ctggtggaag 10500 

tgtgacacgg aggacgactg tggggatcac tcagacgagc ctccagactg tcccgagttc 10560 

aagtgccgcc caggccagtt ccagtgctcc accggcatct gcaccaaccc tgccttcatc 10620 

tgtgatgggg acaatgactg ccaagacaat agtgacgagg ccaattgcga cattcacgtc 10680 

tgcttgccca gccaattcaa gtgcaccaac accaaccgct gcattcctgg catcttccgt 10740 

tgcaatgggc aggacaactg cggggacggc gaggatgagc gggattgccc tgaggtgacc 10800 

tgcgccccca accagttcca gtgctccatc accaagcgct gcatccctcg cgtctgggtc 10860 

tgtgacaggg ataatcactg tgtggacggc agtgatgagc ctgccaactg tacccaaatg 10920 

acctgtggag tggatgagtt ccgctgcaag gattctggcc gctgcatccc cgcgcgctgg 10980 

aagtgtgacg gagaagatga ctgtggggat ggttcagatg agcccaagga agagtgtgat 11040 

gagcgcacct gtgagccata ccagttccgc tgcaaaaaca accgctgtgt cccaggccgt 11100 

tggcaatgtg actacgacaa cgactgcgga gataactcgg acgaggagag ctgcacacct 11160 

cggccctgct ctgagagtga gtttttctgt gccaatggcc gctgcatcgc tgggcgctgg 11220 

aagtgtgatg gggaccatga ctgtgccgac ggctcagacg agaaagactg caccccccgc 11280 

tgtgatatgg accagttcca gtgcaagagt ggccactgca tccccctgcg ctggccgtgt 11340 

gacgcggatg ctgactgtat ggacggcagt gacgaggaag cctgtggcac tggggtgagg 11400 

acctgcccat tggatgagtt tcaatgtaac aacaccttgt gcaagccgct ggcctggaag 11460 

tgtgatggag aggacgactg tggggacaac tcagatgaga accccgagga atgcgcccgg 11520 

ttcatctgcc ctcccaaccg gcctttccgc tgcaagaatg accgagtctg cctgtggatt 11580 

' gggcgccagt gtgatggcgt ggacaactgt ggagatggga ctgacgagga ggactgtgag 11640 ' 

ccccccacgg cccagaaccc ccactgcaaa gacaagaagg agttcctgtg ccgaaaccag 11700 

cgctgtctat catcctccct gcgctgtaac atgttcgatg actgcggcga tggctccgat 11760 

gaagaagatt gcagcatcga ccccaagctg accagctgtg ccaccaatgc cagcatgtgt 11820 

ggggacgaag ctcgttgtgt gcgcactgag aaagctgcct actgtgcctg ccgctcgggc 11880 

ttccatactg tgccgggcca gcccggatgc caggacatca acgagtgcct gcgctttggt 11940 

acctgctctc agctctggaa caaacccaag ggaggccacc tctgcagctg tgcccgcaac 12000 

ttcatgaaga cacacaacac ctgcaaagct gaaggctccg agtaccaggt gctatacatc 12060 

gcggatgaca acgagatccg cagcttgttc ccgggccacc cccactcagc ctacgagcag 12120 

acattccagg gcgatgagag tgtccgcata gatgccatgg atgtccatgt caaggccggc 12180 

cgtgtctact ggactaactg gcacacgggc acaatctcct acaggagcct gccccctgcc 12240 

gcccctccta ccacttccaa ccgccaccgg aggcagatcg accggggtgt cacccacctc 12300 

aatatttcag ggctgaagat gccgaggggt atcgctatcg actgggtggc cgggaatgtg 12360 
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tactggaccg attccggccg agacgtgatt gaggtggcgc aaatgaaggg cgagaaccgc 12420 

aagacgctca tctcgggcat gattgatgag ccccatgcca tcgtggtgga ccctctgagg 12480 

ggcaccatgt actggtcaga ctgggggaac caccccaaga ttgaaacagc agcgatggat 12540 

ggcacccttc gggagactct cgtgcaagac aacattcagt ggcctacagg gctggctgtg 12600 

gactatcaca atgaacggct ctactgggca gatgccaagc tttcggtcat cggcagcatc 12660 

cggctcaaeg gcactgaccc cattgfcggct gctgacagca aacgaggcct aagtcacccc 12720 

ttcagcatcg atgtgtttga agactacatc tacggagtca cttacatcaa taatcgtgtc 12780 

ttcaagatcc acaagtttgg acacagcccc ttgtacaacc taactggggg cctgagccat 12840 

gcctctgatg tagtccttta ccatcaacac aagcagcctg aagtgaccaa cccctgtgac 12900 

cgcaagaaat gcgaatggct gtgtctgctg agccccagcg ggcctgtctg cacctgtccc 12960 

aatggaaaga ggctggataa tggcacctgt gtgcctgtgc cctctccaac accccctcca 13020 

gatgccccta ggcctggaac ctgcactctg cagtgcttca atggtggtag ttgtttcctc 13080 

aacgctcgga ggcagcccaa gtgccgttgc cagccccgtt acacaggcga taagtgtgag 13140 

ctggatcagt gctgggaata ctgtcacaac ggaggcacct gtgcggcttc cccatctggc 13200 

atgcccacgt gccgctgtcc cactggcttc acgggcccca aatgcaccgc acaggtgtgt 13260 

gcaggctact gctctaacaa cagcacctgc accgtcaacc agggcaacca gccccagtgc 13320 

cgatgtctac ctggcttcct gggcgaccgt tgccagtacc ggcagtgctc tggcttctgt 13380 

gagaactttg gcacctgtca gatggctgct gatggctccc gacaatgtcg ctgcaccgtc 13440 

tactttgagg gaccaaggtg tgaggtgaac aagtgtagtc gctgtctcca aggcgcctgt 13500 

gtggtcaata agcagaccgg agatgtcaca tgcaactgca ctgatggccg ggtagccccc 13560 

agttgtctca cctgcatcga tcactgtagc aatggtggct cctgcaccat gaacagcaag 13620 

atgatgcctg agtgccagtg cccgccccat atgacaggac cccggtgcca ggagcaggtt 13680 

gttagtcagc aacagcctgg gcatatggcc tccatcctga tccctctgct gctgcttctc 13740 

ctgctgcttc tggtggctgg cgtggtgttc tggtataagc ggcgagtccg aggggctaag 13800 

ggcttccagc accagcggat gaccaatggg gccatgaatg tggaaattgg aaaccctacc 13860 

tacaagatgt atgaaggtgg agagcccgat gatgtcgggg gcetactgga tgctgatttt 1392 0 

gcccttgacc ctgacaagcc taccaacttc accaacccag tgtatgccac gctctacatg 13980 

gggggccacg gcagccgcca ttccctggcc agcacggacg agaagcgaga actgctgggc 14040 

cggggacctg aagacgagat aggagatccc ttggcatagg gccctgcccc gacggatgtc 14100 

cccagaaagc cccctgccac atgagtcttt caatgaaccc cctccccagc cggcccttct 14160 

ccggccctgc cgggtgtaca aatgtaaaaa tgaaggaatt actttttata tgtgagcgag 14220 

caagcgagca agcacagtat tatctctttg catttccttc ctgcctgctc ctcagtatcc 14280 

cccccatgct gccttgaggg ggcggggagg gctttgtggc tcaaaggtat gaaggagtcc 14340 

acatgttccc taccgagcat acccctggaa gcctggcggc acggcctccc caccaegcct 14400 

gtgcaagaca ctcaacgggg ctccgtgtcc cagctttcct ttccttggct ctctggggtt 14460 
agttcagggg aggtggagtc ctctgctgac cctgtctgga agatttggct ctagctgagg . 14520 

aaggagtctt ttagttgagg gaagtcaccc caaaccccag ctcccacttt caggggcacc 14580 

tctcagatgg ccatgctcag tatcccttcc agacaggccc tcccctctct agcgccccct 14640 

ctgtggctcc tagggctgaa cacattcttt ggtaactgtc ccccaagcct cccatccccc 14700 

tgagggccag gaagagtcgg ggcacaccaa ggaagggcaa gcgggcagcc ccattttggg 14760 

gacgtgaacg ttttaataat ttttgctgaa ttcctttaca actaaataac acagatattg 14820 

ttataaataa aattgtaaaa aaaaaaaaa 14849 



<210> 2 
<211> 4545 
<212> PRT 

<213> Mus musculus 



<400> 2 

Met Leu Thr Pro Pro Leu Leu Leu 

1 5 
Val Ser Gly Ala Thr Met Asp Ala 
20 

Phe Ala Cys Arg Asp Gin lie Thr 
35 40 



Leu Val Pro Leu Leu Ser Ala Leu 

10 15 
Pro Lys Thr Cys Ser Pro Lys Gin 
25 30 
Cys lie Ser Lys Gly frp Arg Cys 
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Cys 


Pro Gin 


Ser 


65 








Leu 


Gly Thr 


Glu 


Gin 


Asp Cys 


Met 








100 


Arg 


Ala Asn 


Cys 






115 




Pro 


Ser 


Gly 


Pro 




130 






Asp 


Glv 


Lys 


Thr 


145 








Cys 


Ser 


Gin 


Leu 


Val 


Glu 


Glv 


Tyr 








180 


Asn 


Glu 


Pro 


Val 






195 




Asn 


He 


Leu 


Ala 




210 






Pro 


Thr 


Ser 


Thr 


225 








Glu 


Thr 


Val 


Cys 


Leu 


Lys 


Cvs 


Ala 








260 


Thr 


lie 


Asn 


He 






275 




Asp 




Leu 


Thr 




290 






He 


Phe 


Val 


Cys 


305 








Leu 


Glu 


Leu 


Tyr 


Lys 


Val 


Phe 


Phe 








340 


Asp 


Met 


Asp 


Gly 






355 




Phe 


Pro 


His 


Gly 




370 






Ala 


Asp 


Ala 


Tyr 


385 








Gly 


Arg 


Gin 


Thr 


Leu 


Thr 


Val 


Phe 








420 


Asn 


Thr 


Gin 


Gin 






435 




Thr 


Glu 


Tyr 


Gin 




450 






He 


Tyr 


His 


Gin 


465 








Asn 


Asp 


Gin 


Tyr 


Ala 


Asn 


Ser 


His 



500 



Asp 


Cys 


Pro 


Asp 






55 




Lys 


Ala 


Gin 


Arg 




70 






Leu 


Cys 


Val 


Pro 


85 








Asp 


Gly 


Ser 


Asp 


Ser 


Arg 


Met 


Gly 








120 


Thr 


Cys 


Tyr 


Cys 






135 




Cys 


Lys 


Asp 


Phe 




150 






Cys 


Thr 


Asn 


Thr 


165 








Leu 


Leu 


Gin 


Pro 


Asp 


Arg 


Pro 


Pro 








200 


Thr 


Tyr 


Leu 


Ser 






215 




Arg 


Gin 


Thr 


Thr 




230 






Trp 


Val 


His 


Val 


245 








Arg 


Met 


Pro 


Gly 


Ser 


Leu 


Ser 


Leu 








280 


Gly 


Asn 


Phe 


Tyr 






295 




Asn 


Arg 


Asn 


Gly 




310 






Ash 


Pro 


Lys 


Gly 


325 








Thr 


Asp 


Tyr 


Gly 


Gin 


Asn 


Arg 


Thr 








360 


He 


Thr 


Leu 


Asp 






375 




Leu 


Asp 


Tyr 


lie 




390 






He 


lie 


Gin 


Gly 


405 








Glu 


Asn 


Tyr 


Leu 


Lys 


Thr 


Ser 


Val 








440 


Val 


Val 


Thr 


Arg 






455 




Arg 


Arg 


Gin 


Pro 




470 






Gly 


Lys 


Pro 


Gly 


485 








Lys 


Ala 


Arg 


Thr 



Gly 


Ser 


Asp 


Glu 








60 


Cys 


pro 


Pro 


Asn 






75 




Met 


Ser 


Arg 


Leu 




90 






Glu 


Gly 


Ala 


His 


105 








Cys 


Gin 


His 


His 


Asn 


Ser 


Ser 


Phe 








140 


Asp 


Glu 


Cys 


Ser 






155 




Asp 


Gly 


Ser 


Phe 




170 






Asp 


Asn 


Arg 


Ser 


185 








val 


Leu 


Leu 


He 


Gly 


Ala 


Gin 


Val 








220 


Ala 


Met 


Asp 


Phe 






235 




Gly 


Asp 


Ser 


Ala 




250 






Leu 


Lys 


Gly 


Phe 


265 








His 


His 


Val 


Glu 


Phe 


Val 


Asp 


Asp 








300 


Asp 


Thr 


Cys 


val 






315 




He 


Ala 


Leu 


Asp 




330 






Gin 


lie 


Pro 


Lys 


345 






• 


Lys 


Leu 


Val 


Asp 


Leu 


Val 


Ser 


Arg 








380 


Glu 


Val 


Val 


Asp 






395 




He 


Leu 


He 


Glu 




410 






Tyr 


Ala 


Thr 


Asn 


425 








He 


Arg 


Val 


Asn 


Val 


Asp 


Lys 


Gly 








460 


Arg 


Val 


Arg 


Ser 






475 




Gly 


Cys 


Ser 


Asp 




490 






Cys 


Arg 


Cys 


Arg 



505 



Ala 


Pro 


Glu 


He 


Glu 


His 


Ser 


Cys 








80 


Cys 


Asn 


Gly 


He 






95 




Cys 


Arg 


Glu 


Leu 




110 






Cys 


Val 


Pro 


Thr 


125 








Gin 


Leu 


Glu 


Ala 


Val 


Tyr 


Gly 


Thr 








160 


Thr 


Cys 


fit w w 

Gly 


Cys 






175 




Cys 


Lys 


Ala 


Lys 




190 






Ala 


Asn 


Ser 


Gin 


205 








Ser 


Thr 


He 


Thr 


Ser 


Tyr 


Ala 


Asn 








240 


TV "I — 

a J. a 


Gin 


Thr 


Gin 






255 




Val 


Asp 


Glu 


His 




270 






Gin 


Met 


Ala 


lie 


285 








lie 


Asp 


Asp 


Arg 


Thr 


Leu 


Leu 


Asp 








320 


Pro 


Ala 


Met 


Gly 






335 




Val 


Glu 


Arg 


Cys 




350 






Ser 


Lys 


lie 


Val 


365 








Leu 


Val 


Tyr 


Trp 


Tyr 


Glu 


Gly 


Lys 








400 


His 


Leu 


Tyr 


Gly 






415 




Ser 


Asp 


Asn 


Ala 




430 






Arg 


Phe 


Asn 


Ser 


445 








Gly 


Ala 


Leu 


His 


His 


Ala 


Cys 


Glu 








480 


He 


Cys 


Leu 


Leu 






495 




Ser 


Gly 


Phe 


Ser 




510 







6 



WO 01/91787 PCT/US01/18047 



Leu Gly Ser Asp 


Gly 


Lys Ser Cys Lys Lys 


Pro Glu His Glu Leu 


Phe 


515 




520 


525 




Leu Val Tyr Gly 


Lys 


Gly Arg Pro Gly lie 


He Arg Gly Met Asp 


Met 


530 




535 


540 




Gly Ala Lys Val 


Pro 


Asp Glu His Met He 


Pro lie Glu Asn Leu 


Met 


545 




550 


555 


560 


As ii Pro Ara Ala 


Leu 


ABp Phe His Ala Glu 


Thr Gly Phe He Tyr 


Phe 




565 


570 


575 




Ala Asn Thr Thr 


Ser 


Tyr Leu lie Gly Arg 


Gin Lys He Asp Gly 


Thr 


580 




585 


590 




Glu Ara GlU Thr 


He 


Leu Lys Asp Gly He 


His Asn Val Glu Gly 


Val 


595 




600 


605 




Ala Val Asp Trp 


Met 


Gly Asp Asn Leu Tyr 


Trp Thr Asp Asp Gly 


Pro 


610 




615 


620 




Lys Lys Thr lie 


Ser 


Val Ala Arg Leu Glu 


Lys Ala Ala Gin Thr 


Arg 


625 




630 


635 


640 


Lys Thr Leu He 


Glu 


Gly Lys Met Thr His 


Pro Arg Ala He Val 


Val 




645 


650 


655 




Asn Pro Leu Asn 


Glv 


Trp Met Tyr Trp Thr 


Asp Trp Glu Glu Asp 


Pro 


660 




665 


670 




Lv<? Ann Ser Ara 


Ara 


Glv Ara Leu Glu Ara 


Ala Trn Met Asn Glv 


Ser 


675 




680 


685 




His Ara Asn He 


Phe 


Val Thr Ser Lys Thr 


Val Leu Trp Pro Asn 


Glv 


690 




695 


700 




"Levi Spt T»eu Asn 

UCU kJCI IICU f\.0£t 


He 


Pro Ala Glv Ara Leu 


Tvr Trn Val Asn Ala 


Phe 


705 




710 


715 


720 


Tyr Asp Arg He 


Glu 


Thr He Leu Leu Asn 


Gly Thr Asp Arg Lys 


He 




725 


730 


735 




Val Tyr Glu Gly 


Pro 


Glu Leu Asn His Ala 


Phe Gly Leu Cys His 


His 


740 




745 


750 




Gly Asn Tyr Leu 


Phe- 


Trp Thr Glu Tyr Arg 


Ser Gly Ser Val Tyr 


Ara 


755 




760 


765 




TjPU Gl n Ara Gl v 


Val 


Ala Glv Ala Pro Pro 


Thr Val Thr Leu Leu 


Ara 


770 




775 


780 




Ser Glu Ara Pro 


Pro 


He Phe Glu He Arg 


Met Tvr Asn Ala His 


Glu 


785 




790 


795 


800 


Gin Gin Val Gly 


Thr 


Asn Lys Cys Arg Val 


Asn Asn Gly Gly Cys 


Ser 




805 


810 


815 




Seir Leu Cys Leu 


Ala 


Thr Pro Gly Ser Arg 


Gin Cys Ala Cys Ala 


Glu 


820 




825 


830 




Asp Gin Val Leu 


Asd 


Thr Asp Gly Val Thr 


Cys Leu Ala Asn Pro 


Ser 


835 




840 


845 




Tyr Val Pro Pro 


Pro 


Gin Cys Gin Pro Gly 


Gin Phe Ala Cys Ala 


Asn 


850 




855 


860 




As ii Ara Cvs He 


Gin 


Glu Ara Trn Lvs Gvs 


Asp Gly Asp Asn Asp 


Cys 


865 




870 


875 


880 


LeU Asn Asn Ser 


Asp 


Glu Ala Pro Ala Leu 


Cys His Gin His Thr 


Cvs 




885 


890 


895 




Pro Ser Asp Arg 


Phe 


Lys Cys Glu Asn Asn 


Arg Cys He Pro Asn 


Ara 


900 




905 


910 




Trp Leu Cys Asp 


Glv 


Asp Asn Asp Cys Gly 


Asn Ser Glu Asp Glu 


Ser 


915 




920 


925 




Asn Ala Thr Cys 


Ser 


Ala Arg Thr Cys Pro 


Pro Asn Gin Phe Ser 


Cys 


930 




935 


940 




Ala Ser Gly Arg 


Cys 


He Pro He Ser Trp 


Thr Cys Asp Leu Asp 


Asp 


945 




950 


955 


960 


Asp Cys Gly Asp 


Arg 


Ser Asp Glu Ser Ala 


Ser Cys Ala Tyr Pro 


Thr 




965 


970 


975 
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Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn Asn Gly Arg Cys lie Asn 

980 985 990 

He Asn Trp Arg Cys Asp Asn Asp Asn Asp Cys Gly Asp Asn Ser Asp 

995 1000 1005 

Glu Ala Gly Cys Ser His Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn 

1010 1015 1020 

Ser Gly Arg Cys He Pro Glu His Trp Thr Cys Asp Gly Asp Asn Asp 
1025 1030 1035 1040 

Cys Gly Asp Tyr Ser Asp Glu Thr His Ala Asn Cys Thr Asn Gin Ala 

1045 1050 1055 

Thr Arg Pro Pro Gly Gly Cys His Ser Asp Glu Phe Gin Cys Pro Leu 

1060 1065 1070 

Asp Gly Leu Cys He Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp 

1075 1080 1085 

Cys Met Asp Ser Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His Val 

1090 1095 1100 

Cys Asp Pro Asn Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys He 
1105 1110 1115 1120 

Ser Lys Ala Trp Val Cys Asp Gly Asp Ser Asp Cys Glu Asp Asn Ser 



Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys Arg Pro Pro Ser His Pro 

1140 1145 1150 

Cys Ala Asn Asn Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys Asp 

1155 1160 1165 

Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp 

1170 1175 1180 

Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala 
1185 1190 1195 1200 

Pro Gly Glu Gly He Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly 

1205 1210 1215 

Ser Asp Asn His Thr Cys Gin He Gin Ser Tyr Cys Ala Lys His Leu 

1220 1225 1230 

Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser 

1235 1240 1245 

Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp Gly Glu Thr Cys Arg Ser 

1250 1255 1260 

Leu Asp Pro Phe Lys Leu Phe He He Phe Ser Asn Arg His Glu lie 
1265 1270 1275 1280 

Arg Arg He Asp Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly 

1285 1290 1295 

Leu Arg Asn Thr He Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu 

1300 1305 1310 

Tyr Trp Thr Asp Ala Val Glu Asp Lys He Tyr Arg Gly Lys Leu Leu 

1315 ■ 1320 1325 

Asp Asn Gly Ala Leu Thr Ser Phe Glu Val Val He Gin Tyr Gly Leu 

1330 1335 1340 

Ala Thr Pro Glu Gly Leu Ala Val Asp Trp He Ala Gly Asn He Tyr 
1345 1350 1355 1360 

Trp Val Glu Ser Asn Leu Asp Gin He Glu Val Ala Lys Leu Asp Gly 

1365 1370 1375 

Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala 

1380 1385 1390 

He Ala Leu Asp Pro Arg Asp Gly He Leu Phe Trp Thr Asp Trp Asp 

1395 1400 1405 

Ala Ser Leu Pro Arg He Glu Ala Ala Ser Met Ser Gly Ala Gly Arg 

1410 1415 1420 

Arg Thr He His Arg Glu Thr Gly Ser Gly Gly Cys Ala Asn Gly Leu 
1425 1430 1435 1440 



1125 



1130 



1135 
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Thr Val Asp Tyr Leu Glu Lys Arg lie Leu Trp He Asp Ala Arg Ser 

1445 1450 1455 

Asp Ala He Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val 

1460 1465 1470 

Leu Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr 

1475 1480 1485 

Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys 

1490 1495 1500 

Ala Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn 
1505 1510 1515 1520 

Thr Gin Pro Phe Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met 

1525 1530 1535 

Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly Arg Gly Pro Cys Ser His 

1540 1545 1550 

Leu Cys Leu He Asn Tyr Asn Arg Thr Val Ser Trp Ala Cys Pro His 

1555 1560 1565 

Leu Met Lys Leu His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys 

1570 1575 ■ 1580 

Phe Leu Leu Tyr Ala Arg Gin Met Glu He Arg Gly Val Asp Leu Asp 
1585 1590 1595 1600 

Ala Pro Tyr Tyr Asn Tyr He He Ser Phe Thr Val Pro Asp He Asp 

1605 1610 1615 

Asn Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp 

1620 1625 1630 

Ser Asp Val Arg Thr Gin Ala He Lys Arg Ala Phe He Asn Gly Thr 

1635 1640 1645 

Gly Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu 

1650 1655 1660 

Ala Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr 
1665 1670 1675 * ^ 1680 

Asn Lys Lys Gin He Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn 

1685 1690 1695 

Ala Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro 

1700 1705 1710 

Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp Asn He Ser Met Ala 

1715 1720 1725 

Asn Met Asp Gly Ser Asn His Thr Leu Leu Phe Ser Gly Gin Lys Gly 

1730 1735 1740 

Pro Val Gly Leu Ala He Asp Phe Pro Glu Ser Lys Leu Tyr Trp lie 
1745 1750 1755 1760 

Ser Ser Gly Asn His Thr He Asn Arg Cys Asn Leu Asp Gly Ser Glu 

1765 1770 1775 

Leu Glu Val He Asp Thr Met Arg Ser Gin Leu Gly Lys Ala Thr Ala 

1780 1785 1790 

Leu Ala He Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu 

1795 1800 1805 

Lys Met Gly Thr Cys Asn Lys Ala Asp Gly Ser Gly Ser Val Val Leu 

1810 1815 1820 

Arg Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser 
1825 1830 1835 1840 

He Gin Leu Glu His Glu Gly Thr Asn Pro Cys Ser Val Asn Asn Gly 

1845 1850 1855 

Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys 

1860 1865 1870 

Met Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu 

1875 1880 1885 

Gly Val Gly Ser Phe Leu Leu Tyr Ser Val His Glu Gly He Arg Gly 
1890 1895 1900 
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lie Pro Leu Asp Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser 
1905 1910 1915 1920 

Gly Thr Ser Leu Ala Val Gly He Asp Phe His Ala Glu Asn Asp Thr 

1925 1930 1935 

He Tyx Trp Val Asp Met Gly Leu Ser Thr He Ser Arg Ala Lys Arg 

1940 1945 1950 

Asp Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly He Gly Arg Val 

1955 1960 1965 

Glu Gly lie Ala Val Asp Trp He Ala Gly Asn He Tyr Trp Thr Asp 

1970 1975 1980 

Gin Gly Phe Asp Val He Glu Val Ala Arg Leu Asn Gly Ser Phe Arg 
1985 1990 1995 2000 

Tyr Val Val He Ser Gin Gly Leu Asp Lys Pro Arg Ala lie Thr Val 

2005 2010 2015 

His Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly His Tyr Pro 

2020 2025 2030 

Arg He Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val 

2035 2040 2045 

Asn Val Ser He Ser Trp Pro Asn Gly lie Ser Val Asp Tyr Gin Gly 

2050 2055 2060 

Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met Asp Lys He Glu Arg He 
2065 2070 2075 2080 

Asp Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn 

2085 2090 2095 

Met Asp Met Phe Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser 

2100 2105 2110 

Asp Arg Thr His Ala Asn Gly Ser He Lys Arg Gly Cys Lys Asp Asn 

2115 2120 2125 

Ala Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys 

2130 2135 2140 

Asp lie Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys 
2145 2150 2155 2160 

Ala Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Gly 

2165 2170 2175 

Gly Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly 

2180 2185 2190 

Ala Ser Cys Arg . Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr 

2195 2200 2205 

He Leu Lys Ser lie His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro 

2210 2215 2220 

Val Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val He Ala Leu 
2225 2230 2235 2240 

Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg He 

2245 2250 2255 

Phe Phe Ser Asp He His Phe Gly Asn lie Gin Gin He Asn Asp Asp 

2260 2265 2270 

Gly Ser Gly Arg Thr Thr He Val Glu Asn Val Gly Ser Val Glu Gly 

2275 2280 2285 

Leu Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr 

2290 2295 2300 

Thr Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala 
2305 2310 2315 2320 

Phe Glu Arg Glu Thr Val He Thr Met Ser Gly Asp Asp His Pro Arg 

2325 2330 2335 

Ala Phe Val Leu Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp 

2340 2345 2350 

Asn Glu Leu His Pro Ser He Met Arg Ala Ala Leu Ser Gly Ala Asn 
2355 2360 2365 
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Val Leu Thr Leu lie Glu Lys Asp lie Arg Thr Pro Asn Gly Leu Ala 

2370 2375 2380 

He Asp His Arg Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp 
2385 2390 2395 2400 

Lys He Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu 

2405 2410 2415 

Lys Ser Glu Pro Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His 

2420 2425 2430 

He Phe Trp Thr Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys 

2435 2440 2445 

Tyr Val Gly Ser Asp Met Lys Leu Leu Arg Val Asp He Pro Gin Gin 

2450 2455 2460 

Pro Met Gly He He Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu 
2465 2470 2475 2480 

Ser Pro Cys Arg lie Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu 

2485 2490 2495 

Thr His Gin Gly His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu 

2500 2505 2510 

Gin Glu Asp Phe Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin 

2515 2520 2525 

Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys He Ser Phe Ser Leu Thr 

2530 2535 2540 

Cys Asp Gly Val Ser His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser 
2545 2550 2555 2560 

Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Asn Asn 

2565 2570 2575 

Gly Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Val Asp Tyr Cys 

2580 2585 2590 

Gly Asp Gly Ser Asp Glu He Pro Cys Asn Lys Thr Ala Cys Gly Val 

2595 2600 2605 

Gly Glu Phe Arg Cys Arg Asp Gly Ser Cys He Gly Asn Ser Ser Arg 

2610 2615 2620 

Cys Asn Gin Phe Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys 
2625 2630 2635 2640 

Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val 

2645 2650 2655 

Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp 

2660 2665 2670 

Val Cys Asp Gly Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp 

2675 2680 2685 

Cys Pro Gly Val Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys 

2690 2695 2700 

Pro Ser Gly Arg Cys He Pro Met Ser Trp Thr Cys Asp Lys Glu Asp 
2705 2710 2715 2720 

Asp Cys Glu Asn Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser 

2725 2730 2735 

Glu Ala Gin Phe Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp 

2740 2745 2750 

Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala 

2755 * 2760 2765 

His Cys Glu Gly Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly 

2770 2775 2780 

Thr His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp 
2785 2790 2795 2800 

Cys Thr Asp Gly Ala Asp Glu Ser Val Thr Ala Gly Cys Leu Tyr Asn 

2805 2810 2815 

Ser Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Asn Arg Leu Cys He 
2820 2825 2830 
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Pro Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser 

2835 2840 2845 

Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr Cys Gly Pro Asn Glu Phe 

2850 " 2855 2860 

Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Gin Trp Glu Cys Asp 
2865 2870 2875 288C 

Gly Glu Asn Asp Cys His Asp His Ser Asp Glu Ala Pro Lys Asn Pro 

2885 2890 2895 

His Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu 

2900 2905 2910 

Cys Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin 

2915 2920 2925 

Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg Gly Cys His Val Asn Glu 

2930 2935 2940 

Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu 
2945 2950 2955 296C 

Lys lie Gly Phe Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp 

2965 2970 2975 

Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu Cys Ser Thr Thr Phe Pro 

2980 2985 2990 

Cys Ser Gin Leu Cys lie Asn Thr His Gly Ser Tyr Lys Cys Leu Cys 

2995 3000 3005 

Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp Pro His Ser Cys Lys Ala 

3010 3015 3020 

Val Thr Asp Glu Glu Pro Phe Leu lie Phe Ala Asn Arg Tyr Tyr Leu 
3025 3030 3035 304C 

Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly 

3045 3050 3055 

Leu Asn Asn Ala Val Ala Leu Ala Phe Asp Tyr Arg Glu Gin Met lie 

3060 3065 3070 

Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser Met He Arg Arg Met His 

3075 3080 3085 

Leu Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn 

3090 3095 3100 

Pro Asp Gly Leu Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys 
3105 3110 3115 312( 

Asp Lys Gly Arg Asp Thr He Glu Val Ser Lys Leu Asn Gly Ala Tyr 

3125 3130 3135 

Arg Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val 

3140 3145 3150 

Val Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His 

3155 3160 3165 

Ser Leu He Gly Arg He Gly Met Asp Gly Ser Gly Arg Ser lie He 

3170 3175 3180 

Val Asp Thr Lys He Thr Trp Pro Asn Gly Leu Thr Val Asp Tyr Val 
3185 3190 3195 320< 

Thr Glu Arg He Tyr Trp Ala Asp Ala Arg Glu Asp Tyr He Glu Phe 

3205 3210 3215 

Ala Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp He 

3220 322$ 3230 

Pro His He Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr 

3235 3240 3245 

Asp Trp Glu Thr Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Ala 

3250 3255 3260 

Asn Lys Thr Leu Leu He Ser Thr Leu His Arg Pro Met Asp Leu His 
3265 3270 3275 328i 

Val Phe His Ala Leu Arg Gin Pro Asp Val Pro Asn His Pro Cys Lys 



3285 



3290 



3295 
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Val Asn Asn Gly Gly Cys Ser Asn Leu Cya Leu Leu Ser Pro Gly Gly 

3300 3305 3310 

Gly His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Gly Asp Gly 

3315 3320 3325 

Arg Thr Cys Val Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn 

3330 3335 3340 

Asp Lys Cys lie Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys 
3345 ' 3350 3355 3360 

Gly Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg 

3365 3370 3375 

Pro Gly Gin Phe Gin Cys Ser Thr Gly lie Cys Thr Asn Pro Ala Phe 

3380 3385 3390 

lie Cys Asp Gly Asp Asn Asp Cys * Gin Asp Asn Ser Asp Glu Ala Asn 

3395 3400 3405 

Cys Asp lie His Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr 

3410 3415 3420 

Asn Arg Cys He Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys 
3425 3430 3435 3440 

Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro 

3445 3450 3455 

Asn Gin Phe Gin Cys Ser He Thr Lys Arg Cys He Pro Arg Val Trp 

3460 3465 3470 

Val Cys Asp Arg Asp Asn His Cys Val Asp Gly Ser Asp Glu Pro Ala 

3475 3480 3485 

Asn Cys Thr Gin Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp 

3490 3495 3500 

Ser Gly Arg Cys He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp 
3505 " 3510 3515 3520 

Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr 

3525 3530 3535 

Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly 

3540 3545 3550 

Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu 

3555 3560 3565 

Glu Ser Cys Thr Pro Arg Pro Cys Ser Glu Ser Glu Phe Phe Cys Ala 

3570 3575 3580 

Asn Gly Arg Cys lie Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp 
3585 3590 3595 3600 

Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met 

3605 3610 3615 

Asp Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Pro 

3620 3625 3630 

Cys Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys 

3635 3640 3645 

Gly Thr Gly Val Arg Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn 

3650 3655 3660 

Thr Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys 
3665 3670 3675 3680 

Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe lie Cys 

3685 3690 3695 

Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp 

3700 3705 3710 

He Gly Arg Gin Cys Asp Gly Val Asp Asn Cys Gly Asp Gly Thr Asp 

3715 3720 3725 

Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin Asn Pro His Cys Lys Asp 

3730 3735 3740 

Lys Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu 
3745 3750 3755 3760 
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Arg Cys Asn Met Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Glu Asp 

3765 3770 3775 

Cys Ser lie Asp Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser Met 

3780 3785 3790 

Cys Gly Asp Glu Ala Arg Cys Val Arg Thr Glu Lys Ala Ala Tyr Cys 

3795 3800 3805 

Ala Cys Arg Ser Gly Phe His Thr Val Pro Gly Gin Pro Gly Cys Gin 

3810 3815 3820 

Asp He Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Trp Asn 
3825 * 3830 3835 3840 

Lys Pro Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys 

3845 3850 3855 

Thr His Asn Thr Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr 

3860 3865 3870 

He Ala Asp Asp Asn Glu He Arg Ser Leu Phe Pro Gly His Pro His 

3875 3880 3885 

Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp Glu Ser Val Arg He Asp 

3890 3895 3900 

Ala Met Asp Val His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp 
3905 3910 3915 3920 

His Thr Gly Thr lie Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro 

3925 3930 3935 

Thr Thr Ser Asn Arg His Arg Arg Gin He Asp Arg Gly Val Thr His 

3940 3945 3950 

Leu Asn He Ser Gly Leu Lys Met Pro Arg Gly He Ala He Asp Trp 

3955 3960 3965 

Val Ala Gly Asn Val Tyr Trp Thr Asp Ser Gly Arg Asp Val He Glu 

3970 3975 3980 

Val Ala Gin Met Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met 
3985 3990 3995 * 4000 

He Asp Glu Pro His Ala He Val Val Asp Pro Leu Arg Gly Thr Met 

4005 4010 4015 

Tyr Trp Ser Asp Trp Gly Asn His Pro Lys lie Glu Thr Ala Ala Met 

4020 4025 4030 

Asp Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn He Gin Trp Pro 

4035 4040 4045 

Thr Gly Leu Ala Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp 

4050 4055 4060 

Ala Lys Leu Ser Val lie Gly Ser lie Arg Leu Asn Gly Thr Asp Pro 
4065 4070 4075 4080 

He Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He 

4085 4090 4095 

Asp Val Phe Glu Asp Tyr He Tyr Gly Val Thr Tyr He Asn Asn Arg 

4100 4105 4110 

Val Phe Lys He His Lys Phe Gly His Ser Pro Leu Tyr Asn Leu Thr 

4115 4120 4125 

Gly Gly Leu Ser His Ala Ser Asp Val Val Leu Tyr His Gin His Lys 

4130 4135 4140 

Gin Pro Glu Val Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu 
4145 4150 4155 4160 

Cys Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys 

4165 4170 4175 

Arg Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro 

4180 4185 4190 

Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr Leu Gin Cys Phe Asn Gly 

4195 4200 4205 

Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin 
4210 4215 4220 
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Pro Arg Tyr Thr Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu Tyr 
4225 4230 4235 4240 

Cys His Asn Gly Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr 

4245 4250 4255 

Cys Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Ala Gin Val 

4260 4265 4270 

Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr Cys Thr Val Asn Gin Gly 

4275 4280 4285 

Asn Gin Pro Gin Cys Arg Cys Leu Pro Gly Phe Leu Gly Asp Arg Cys 

4290 4295 4300 

Gin Tyr Arg Gin Cys Ser Gly Phe Cys Glu Asn Phe Gly Thr Cys Gin 
4305 4310 4315 4320 

Met Ala Ala Asp Gly Ser Arg Gin Cys Arg Cys Thr Val Tyr Phe Glu 

4325 4330 4335 

Gly Pro Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Gin Gly Ala 

4340 4345 4350 

Cys Val Val Asn Lys Gin Thr Gly Asp Val Thr Cys Asn Cys Thr Asp 

4355 4360 4365 

Gly Arg Val Ala Pro Ser Cys Leu Thr Cys lie Asp His Cys Ser Asn 

4370 4375 4380 

Gly Gly Ser Cys Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys 
4385 4390 4395 4400 

Pro Pro His Met Thr Gly Pro Arg Cys Gin Glu Gin Val Val Ser Gin 

4405 4410 4415 

Gin Gin Pro Gly His Met Ala Ser lie Leu lie Pro Leu Leu Leu Leu 

4420 4425 4430 

Leu Leu Leu Leu Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg 

4435 4440 4445 

Val Arg Gly Ala Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala 

4450 4455 4460 

Met Asn Val Glu lie Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly 
4465 4470 4475 4480 

Glu Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp 

4485 4490 4495 

Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr 

4500 4505 4510 

Met Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys 

4515 4520 4525 

Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp Glu lie Gly Asp Pro Leu 
4530 4535 4540 

Ala 
4545 

<210> 3 
<211> 4577 
<212> DNA 
. <213> Homo sapiens 

<400> 3 

gctacaatcc atctggtctc ctccagctcc ttctttctgc aacatgggga agaacaaact 60 

ccttcatcca agtctggttc ttctcctctt ggtcctcctg cccacagacg cctcagtctc 120 

tggaaaaccg cagtatatgg ttctggtccc ctccctgctc cacactgaga ccactgagaa 180 

gggctgtgtc cttctgagct acctgaatga gacagtgact gtaagtgctt ccttggagtc 240 

tgtcagggga aacaggagcc tcttcactga cctggaggcg gagaatgacg tactccactg 300 

tgtcgccttc gctgtcccaa agtcttcatc caatgaggag gtaatgttcc tcactgtcca 3 60 

agtgaaagga ccaacccaag aatttaagaa gcggaccaca gtgatggtta agaacgagga 420 

cagtctggtc tttgtccaga cagacaaatc aatctacaaa ccagggcaga cagtgaaatt 480 

tcgtgttgtc tccatggatg aaaactttca ccccctgaat gagttgattc cactagtata 540 



15 



WO 01/91787 



PCT/USO 1/18047 



cattcaggat cccaaaggaa atcgcatcgc acaatggcag agtttccagt tagagggtgg 600 
cctcaagcaa ttttcttttc ccctctcatc agagcccttc cagggctcct acaaggtggt 660 
ggtacagaag aaatcaggtg gaaggacaga gcaccctttc accgtggagg aatttgttct 720 
tcccaagttt gaagtacaag taacagtgcc aaagataatc accatcttgg aagaagagat 780 
gaatgtatca gtgtgtggcc tatacacata tgggaagcct gtccctggac atgtgactgt 840 
gagcatttgc agaaagtata gtgacgcttc cgactgccac ggtgaagatt cacaggcttt 900 
ctgtgagaaa ttcagtggac agctaaacag ccatggctgc ttctatcagc aagtaaaaac 960 

caaggtcttc cagctgaaga ggaaggagta tgaaatgaaa cttcacactg aggcccagat 1020 

ccaagaagaa ggaacagtgg tggaattgac tggaaggcag tccagtgaaa tcacaagaac 1080 

cataaccaaa ctctcatttg tgaaagtgga ctcacacttt cgacagggaa ttcccttctt 1140 

tgggcaggtg cgcctagtag atgggaaagg cgtccctata ccaaataaag tcatattcat 1200 

cagaggaaat gaagcaaact attactccaa tgctaccacg gatgagcatg gccttgtaca 1260 

gttctctatc aacaccacca acgttatggg tacctctctt actgttaggg tcaattacaa 1320 

ggatcgtagt ccctgttacg gctaccagtg ggtgtcagaa gaacacgaag aggcacatca 13 80 

cactgcttat cttgtgttct ccccaagcaa gagctttgtc caccttgagc ccatgtctca 1440 

tgaactaccc tgtggccata ctcagacagt ccaggcacat tatattctga atggaggcac 1500 

cctgctgggg ctgaagaagc tctcctttta ttatctgata atggcaaagg gaggcattgt 1560 

ccgaactggg actcatggac tgcttgtgaa gcaggaagac atgaagggcc atttttccat 1620 

ctcaatccct gtgaagtcag acattgctcc tgtcgctcgg ttgctcatct atgctgtttt 1680 

acctaccggg gacgtgattg gggattctgc aaaatatgat gttgaaaatt gtctggccaa 1740 

caaggtggat ttgagcttca gcccatcaca aagtctccca gcctcacacg cccacctgcg 1800 

agtcacagcg gctcctcagt ccgtctgcgc cctccgtgct gtggaccaaa gcgtgctgct 1860 

catgaagcct gatgctgagc tctcggcgtc ctcggtttac aacctgctac cagaaaagga 1920 

cctcactggc ttccctgggc ctttgaatga ccaggacgat gaagactgca tcaatcgtca 1980 

taatgtctat attaatggaa tcacatatac tccagtatca agtacaaatg aaaaggatat 2040 

gtacagcttc ctagaggaca tgggcttaaa ggcattcacc aactcaaaga ttcgtaaacc 2100 

caaaatgtgt ccacagcttc aacagtatga aatgcatgga cctgaaggtc tacgtgtagg 2160 

tttttatgag tcagatgtaa tgggaagagg ccatgcacgc ctggtgcatg ttgaagagcc 2220 

tcacacggag accgtacgaa agtacttccc tgagacatgg atctgggatt tggtggtggt 2280 

aaactcagca ggggtggctg aggtaggagt aacagtccct gacaccatca ccgagtggaa 2340 

ggcaggggcc ttctgcctgt ctgaagatgc tggacttggt atctcttcca ctgcctctct 2400 

ccgagccttc cagcccttct ttgtggagct tacaatgcct tactctgtga ttcgtggaga 2460 

ggccttcaca ctcaaggcca cggtcctaaa ctaccttccc aaatgcatcc gggtcagtgt 2520 

gcagctggaa gcctctcccg ccttccttgc tgtcccagtg gagaaggaac aagcgcctca 25 80 

ctgcatctgt gcaaacgggc ggcaaactgt gtcctgggca gtaaccccaa agtcattagg 2640 

aaatgtgaat ttcactgtga gcgcagaggc actagagtct caagagctgt gtgggactga 2700 

ggtgccttca gttcctgaac acggaaggaa agacacagtc atcaagcctc tgttggttga 2760 

acctgaagga ctagagaagg aaacaacatt caactcccta ctttgtccat caggtggtga 2820 

ggtttctgaa gaattatccc tgaaactgcc accaaatgtg gtagaagaat ctgcccgagc 2880 

ttctgtctca gttttgggag acatattagg ctctgccatg caaaacacac aaaatcttct 2940 

ccagatgccc tatggctgtg gagagcagaa tatggtcctc tttgctccta acatctatgt 3000 

actggattat ctaaatgaaa cacagcagct tactccagag gtcaagtcca aggccattgg 3060 

ctatctcaac actggttacc agagacagtt gaactacaaa cactatgatg gctcctacag 3120 

cacctttggg gagcgatatg gcaggaacca gggcaacacc tggctcacag cctttgttct 3180 

gaagactttt gcccaagctc gagcctacat cttcatcgat gaagcacaca ttacccaagc 3240 

cctcatatgg ctctcccaga ggcagaagga caatggctgt ttcaggagct ctgggtcact 3300 

gctcaacaat gccataaagg gaggagtaga agatgaagtg accctctccg cctatatcac 3360 

catcgccctt ctggagattc ctctcacagt cactcaccct gttgtccgca atgccctgtt 3420 

ttgcctggag tcagcctgga agacagcaca agaaggggac catggcagcc atgtatatac 3480 

caaagcactg ctggcctatg cttttgccct ggcaggtaac caggacaaga ggaaggaagt 3540 

actcaagtca cttaatgagg aagctgtgaa gaaagacaac tctgtccatt gggagcgccc 3600 

tcagaaaccc aaggcaccag tggggcattt ttacgaaccc caggctccct ctgctgaggt 3660 

ggagatgaca tcctatgtgc tcctcgctta tctcacggcc cagccagccc caacctcgga 3720 

ggacctgacc tctgcaacca acatcgtgaa gtggatcacg aagcagcaga atgcccaggg 3780 

cggtttctcc tccacccagg acacagtggt ggctctccat gctctgtcca aatatggagc 3 840 

cgccacattt accaggactg ggaaggctgc acaggtgact atccagtctt cagggacatt 3900 

ttccagcaaa ttccaagtgg acaacaacaa tcgcctgtta ctgcagcagg tctcattgcc 3960 

agagctgcct ggggaataca gcatgaaagt gacaggagaa ggatgtgtct acctccagac 4020 
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ctccttgaaa tacaatattc tcccagaaaa ggaagagttc ccctttgctt taggagtgca 4080 

gactctgcct caaacttgtg atgaacccaa agcccacacc agcttccaaa tctccctaag 4140 

tgtcagttac acagggagcc gctctgectc caacatggcg atcgttgatg tgaagatggt 4200 

ctctggcttc attcccctga agccaacagt gaaaatgctt gaaagatcta accatgtgag 4260 

ccggacagaa gtcagcagca accatgtctt gatttacctt gataaggtgt caaatcagac 4320 

actgagcttg ttcttcacgg ttctgcaaga tgtcccagta agagatctca aaccagccat 4380 

agtgaaagtc tatgattact acgagacgga tgagtttgca atcgctgagt acaatgctcc 4440 

ttgcagcaaa gatcttggaa atgcttgaag accacaaggc tgaaaagtgc tttgctggag 4500 

tcctgttctc tgagctccac agaagacacg tgtttttgta tctttaaaga cttgatgaat 4560 

aaacactttt tctggtc 4577 

<210> 4 

<211> 4422 

<212> DNA 

<213> Homo sapiens 

<400> 4 

atggggaaga acaaactcct tcatccaagt ctggttcttc tcctcttggt cctcctgccc 60 

acagacgcct cagtctctgg aaaaccgcag tatatggttc tggtcccctc cctgctccac 120 

actgagacca ctgagaaggg ctgtgtcctt ctgagctacc tgaatgagac agtgactgta 180 

agtgcttcct tggagtctgt caggggaaac aggagcctct tcactgacct ggaggcggag 240 

aatgacgtac tccactgtgt cgccttcgct gtcccaaagt cttcatccaa tgaggaggta 300 

atgttcctca ctgtccaagt gaaaggacca acccaagaat ttaagaagcg gaccacagtg 360 

atggttaaga acgaggacag tctggtcttt gtccagacag acaaatcaat ctacaaacca 420 

gggcagacag tgaaatttcg tgttgtictcc atggatgaaa actttcaccc cctgaatgag 4 80 

ttgattccac tagtatacat tcaggatccc aaaggaaatc gcatcgcaca atggcagagt 540 

ttccagttag agggtggcct caagcaattt tcttttcccc tctcatcaga gcccttccag 600 

ggctcctaca aggtggtggt acagaagaaa tcaggtggaa ggacagagca ccctttcacc 660 

gtggaggaat ttgttcttcc caagtttgaa gtacaagtaa cagtgccaaa gataatcacc 720 

atcttggaag aagagatgaa tgtatcagtg tgtggcctat acacatatgg gaagcctgtc 780 

cctggacatg tgactgtgag catttgcaga aagtatagtg acgcttccga ctgccacggt 840 

gaagattcac aggctttctg tgagaaattc agtggacagc taaacagcca tggctgcttc 900 

tatcagcaag taaaaaccaa ggtcttccag ctgaagagga aggagtatga aatgaaactt 960 

cacactgagg cccagatcca agaagaagga acagtggtgg aattgactgg aaggcagtcc 102 0 

agtgaaatca caagaaccat aaccaaactc tcatttgtga aagtggactc acactttcga 1080 

cagggaattc ccttctttgg gcaggtgcgc ctagtagatg ggaaaggcgt ccctatacca 1140 

aataaagtca tattcatcag aggaaatgaa gcaaactatt actccaatgc taccacggat 1200 

gagcatggcc ttgtacagtt ctctatcaac accaccaacg ttatgggtac ctctcttact 1260 

gttagggtca attacaagga tcgtagtccc tgttacggct accagtgggt gtcagaagaa 132 0 

cacgaagagg cacatcacac tgcttatctt gtgttctccc caagcaagag ctttgtccac 1380 

cttgagccca tgtctcatga actaccctgt ggccatactc agacagtcca ggcacattat 1440 

attctgaatg gaggcaccct gctggggctg aagaagctct ccttttatta tctgataatg 1500 

gcaaagggag gcattgtccg aactgggact catggactgc ttgtgaagca ggaagacatg 1560 

aagggccatt tttccatctc aatccctgtg aagtcagaca ttgctcctgt cgctcggttg 1620 

ctcatctatg ctgttttacc taccggggac gtgattgggg attctgcaaa atatgatgtt 1680 

gaaaattgtc tggccaacaa ggtggatttg agcttcagcc catcacaaag tctcccagcc 1740 

tcacacgccc acctgcgagt cacagcggct cctcagtccg tctgcgccct ccgtgctgtg 1800 

gaccaaagcg tgctgctcat gaagcctgat gctgagctct cggcgtcctc ggtttacaac 1860 

ctgctaccag aaaaggacct cactggcttc cctgggcctt tgaatgacca ggacgatgaa 1920 

gactgcatca atcgtcataa tgtctatatt aatggaatca catatactcc agtatcaagt 1980 

acaaatgaaa aggatatgta cagcttccta gaggacat^g gcttaaaggc attcaccaac 2040 

tcaaagattc gtaaacccaa aatgtgtcca cagcttcaac agtatgaaat gcatggacct 2100 

gaaggtctac gtgtaggttt ttatgagtca gatgtaatgg gaagaggcca tgcacgcctg 2160 

gtgcatgttg aagagcctca cacggagacc gtacgaaagt acttccctga gacatggatc 2220 

tgggatttgg tggtggtaaa ctcagcaggg gtggctgagg taggagtaac agtccctgac 2280 

accatcaccg agtggaaggc aggggccttc tgcctgtctg aagatgctgg acttggtatc 2340 

tcttccactg cctctctccg agccttccag cccttctttg tggagcttac aatgccttac 2400 

tctgtgattc gtggagaggc cttcacactc aaggccacgg tcctaaacta ccttcccaaa 2460 
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tgcatccggg tcagtgtgea gctggaagcc tctcccgcct tccttgetgt cccagtggag 2520 

aaggaacaag cgcctcactg catctgtgca aacgggcggc aaactgtgtc ctgggcagta 2580 

accccaaagt cattaggaaa tgtgaatttc actgtgagcg cagaggcact agagtctcaa 2640 

gagctgtgtg ggactgaggt gccttcagtt cctgaacacg gaaggaaaga cacagtcatc 2700 

aagcctctgt tggttgaacc tgaaggacta gagaaggaaa caacattcaa ctccctactt 2760 

tgtccatcag gtggtgaggt ttctgaagaa ttatccctga aactgccacc aaatgtggta 2820 

gaagaatctg cccgagcttc tgtctcagtt ttgggagaca tattaggctc tgccatgcaa 2880 

aacacacaaa atcttctcca gatgccctat ggctgtggag agcagaatat ggtcctcttt 2940 

gctcctaaca tctatgtact ggattatcta aatgaaacac agcagcttac tccagaggtc 3000 

aagtccaagg ccattggcta tctcaacact ggttaccaga gacagttgaa ctacaaacac 3060 

tatgatggct cctacagcac ctttggggag cgatatggca ggaaccaggg caacacctgg 3120 

ctcacagcct ttgttctgaa gacttttgcc caagctcgag cctacatctt catcgatgaa 3180 

gcacacatta cccaagccct catatggctc tcccagaggc agaaggacaa tggctgtttc 3240 

aggagctctg ggtcactgct caacaatgcc ataaagggag gagtagaaga tgaagtgacc 3300 

ctctccgcct atatcaccat cgcccttctg gagattcctc tcacagtcac tcaccctgtt 3360 

gtccgcaatg ccctgttttg cctggagtca gcctggaaga cagcacaaga aggggaccat 3420 

ggcagccatg tatataccaa agcactgctg gcctatgctt ttgccctggc aggtaaccag 3480 

gacaagagga aggaagtact caagtcactt aatgaggaag ctgtgaagaa agacaactct 3540 

gtccattggg agcgccctca gaaacccaag gcaccagtgg ggcattttta cgaaccccag 3600 

gctccctctg ctgaggtgga gatgacatcc tatgtgctcc tcgcttatct cacggcccag 3660 

ccagccccaa cctcggagga cctgacctct gcaaccaaca tcgtgaagtg gatcacgaag 3720 

cagcagaatg cccagggcgg tttctcctcc acccaggaca cagtggtggc tctccatgct 3780 

ctgtccaaat atggagccgc cacatttacc aggactggga aggctgcaca ggtgactatc 3840 

cagtcttcag ggacattttc cagcaaattc caagtggaca acaacaatcg cctgttactg 3900 

cagcaggtct cattgccaga gctgcctggg gaatacagca tgaaagtgac aggagaagga 3960 

tgtgtctacc tccagacctc cttgaaatac aatattctcc cagaaaagga agagttcccc 4020 

tttgctttag gagtgcagac tctgcctcaa acttgtgatg aacccaaagc ccacaccagc 4 080 

ttccaaatct ccctaagtgt cagttacaca gggagccgct ctggetccaa catggcgatc 4140 

gttgatgtga agatggtctc tggcttcatt cccctgaagc caacagtgaa aatgcttgaa 4200 

agatctaacc atgtgagccg gacagaagtc agcagcaacc atgtcttgat ttaccttgat 4260 

aaggtgtcaa atcagacact gagcttgttc ttcacggttc tgcaagatgt cccagtaaga 4320 

gatctcaaac cagccatagt gaaagtctat gattactacg agacggatga gtttgcaatc 43 80 

gctgagtaca atgctccttg cagcaaagat cttggaaatg ct 4422 

<210> 5 

<211> 1474 

<212> PRT 

<213> Homo sapiens 





<400> 


5 














Met 


Gly 


Lys 


Asn 


Lys 


Leu Leu 


His 


Pro 


Ser 


Leu Val Leu Leu Leu Leu 


1 






5 








10 


15 


Val 


Leu 


Leu 


Pro 


Thr 


Asp Ala 


Ser 


Val 


Ser 


Gly Lys Pro Gin Tyr Met 








20 








25 




30 


Val 


Leu 


Val 


Pro 


Ser 


Leu Leu 


His 


Thr 


Glu 


Thr Thr Glu Lys Gly Cys 






35 








40 






45 


Val 


Leu 


Leu 


Ser 


Tyr 


Leu Asn 


Glu 


Thr 


Val 


Thr Val Ser Ala Ser Leu 




50 






55. 








60 


Glu 


Ser 


Val 


Arg 


Gly 


Asn Arg 


Ser 


Leu 


Phe 


Thr Asp Leu Glu Ala Glu 


65 






70 








75 ' 80 


Asn 


Asp 


Val 


Leu 


His 


Cys Val 


Ala 


Phe 


Ala 


Val Pro Lys Ser Ser Ser 








85 






90 


95 


Asn 


Glu 


Glu 


Val 


Met 


Phe Leu 


Thr 


Val 


Gin 


Val Lys Gly Pro Thr Gin 








100 








105 




110 


Glu 


Phe 


Lys 


Lys 


Arg 


Thr Thr 


Val 


Met 


Val 


Lys Asn Glu Asp Ser Leu 






115 








120 






125 


Val 


Phe 


Val 


Gin 


Thr 


Asp Lys 


Ser 


He 


Tyr 


Lys Pro Gly Gin Thr Val 




130 








135 








140 
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Lys Phe Arg Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 
145 150 155 160 

Leu He Pro Leu Val Tyr lie Gin Asp Pro Lys Gly Asn Arg He Ala 

165 170 175 

Gin Trp Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe 

180 185 190 

Pro Leu Ser Ser Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin 

195 200 205 

Lys Lys Ser Gly Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe 

210 215 220 

Val Leu Pro Lys Phe Glu Val Gin Val Thr Val Pro Lys He He Thr 
225 230 235 240 

lie Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 

245 250 255 

Gly Lys Pro Val Pro Gly His Val Thr Val Ser lie Cys Arg Lys Tyr 

260 265 270 

Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu 

275 280 285 

Lys Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 

290 295 300 

Lys Thr Lys Val Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu 
305 310 315 320 

His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 

325 330 335 

Gly Arg Gin Ser Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe 

340 345 350 

Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 

355 360 365 

Val Arg Leu Val Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He 

370 375 380 

Phe He Arg Gly Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp 
385 390 395 = 400 

Glu His Gly Leu Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly 

405 410 415 

Thr Ser Leu Thr Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 

420 425 430 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 

435 440 445 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 

450 455 460 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 
465 470 475 480 

He Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 

485 490 495 

Tyr Leu He Met Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly 

500 505 510 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser He Ser He 

515 520 525 

Pro Val Lys Ser Asp He Ala Pro Val Ala Arg Leu Leu He Tyr Ala 

530 535 540 

Val Leu Pro Thr Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val 
545 550 555 560 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 

565 570 575 

Ser Leu Pro Ala Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin 

580 585 590 

Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 
595 600 605 
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Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 

610 615 620 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 
625 630 635 640 

Asp Cys lie Asn Arg His Asn Val Tyr lie Asn Gly He Thr Tyr Thr 

645 650 655 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp 

660 665 670 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys He Arg Lys Pro Lys Met 

675 680 685 

Cys Pro Gin Leu Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg 

690 695 700 

Val Gly Phe Tyr Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu 
705 710 715 720 

Val His Val Glu Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro 

725 730 735 

Glu Thr Trp He Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala 

740 745 750 

Glu Val Gly Val Thr Val Pro Asp Thr He Thr Glu Trp Lys Ala Gly 

755 760 765 

Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly He Ser Ser Thr Ala 

770 775 780 

Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr 
785 790 795 800 

Ser Val He Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn 

805 810 815 

Tyr Leu Pro Lys Cys He Arg Val Ser Val Gin Leu Glu Ala Ser Pro 

820 825 83 0 

Ala Phe Leu Ala Val Pro Val Glu Lys Glu Gin Ala Pro His Cys He 

835 840 845 

Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser 

850 855 860 

Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin 
865 870 875 880 

Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys 

885 890 895 

Asp Thr Val He Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys 

900 905 910 

Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser 

915 920 925 

Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu Ser Ala 

930 935 940 

Arg Ala Ser Val Ser Val Leu Gly Asp He Leu Gly Ser Ala Met Gin 
945 950 955 960 

Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn 

965 970 975 

Met Val Leu Phe Ala Pro Asn lie Tyr Val Leu Asp Tyr Leu Asn Glu 

980 985 990 

Thr Gin Gin Leu Thr Pro Glu Val Lys Ser Lys Ala He Gly Tyr Leu 

995 1000 1005 

Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser 

1010 1015 1020 

Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp 
1025 1030 1035 1040 

Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr He 

1045 1050 1055 

Phe He Asp Glu Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin 
1060 1065 1070 
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Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn 

1075 1080 1085 

Asn Ala lie Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr 

1090 1095 1100 

He Thr He Ala Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val 
1105 1110 1115 1120 

Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin 

1125 1130 1135 

Glu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr 

1140 1145 1150 

Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys 

1155 1160 1165 

Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu 

1170 1175 1180 

Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin 
1185 1190 1195 1200 

Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr 

1205 1210 1215 

Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala * Thr 

1220 1225 1230 

Asn He Val Lys Trp lie Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe 

1235 1240 1245 

Ser Ser Thr Gin Asp Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr 

1250 1255 1260 

Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He 
1265 1270 1275 1280 

Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn 

1285 1290 1295 

Arg Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr 

1300 1305 1310 

Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu 

1315 1320 1325 

Lys Tyr Asn lie Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly 

1330 1335 1340 

Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser 
1345 1350 1355 1360 

Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser 

1365 1370 1375 

Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu 

1380 1385 1390 

Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr 

1395 1400 1405 

Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val Ser Asn 

1410 1415 1420 

Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg 
1425 1430 1435 1440 

Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp 

1445 1450 1455 

Glu Phe Ala He Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly 
1460 1465 1470 

Asn Ala 
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